(19) 



J 



Europaisches Patentami 
Europ an Patent Office 
Offic europ en des brevets 




(12) 



EUROPEAN PATENT APPLICATION 



EP 1 035 207 A1 

2^ 



(43) Date of publication: 

13.09.2000 Bulletin 2000/37 

(21) Application number: 99104664.0 

(22) Date of filing: 09.03.1999 



(51) IntCI 7 : C12N 15/53, C12N 15/11, 
C12N 15/85, C12N 9/02, 
C12N 5/10, C12Q 1/02, 
C07K 16/40, A61K 39/395, 
A61 K 38/44, A01 K 67/027, 
G01N 33/50, G01N 33/53 



7* 



(84) Designated Contracting States: 

AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 
MC NL PT SE 

Designated Extension States: 
AL LT LV MK RO SI 

(71) Applicant: MultiGene Biotech GmbH 
97074 WOrzburg (DE) 

(72) Inventors: 

• Weber, Bernhard H.F. 
97218 Gerbrunn (DE) 



• Marquardt, Andreas 
97218 Gerbrunn (DE) 

(74) Representative: Schmidt, Werner, Dr. 
Robert-Bunsen-Strasse 15 
65929 Frankf urt am Main (DE) 

Remarks: 

The applicant has subsequently filed a sequence 
listing and declared, that it includes no new matter. 



(54) cDNA molecules of the members of gene family encoding human fatty acid desaturases and 
their use in diagnosis and therapy 



(57) The present invention relates to the cloning and 
sequencing of the cDNA molecules of three members 
of a gene family encoding three human fatty acid desat- 
urases, latty acid desaturase-1 (FADS 1), fatty acidde- 
saturase-2 (FADS2) and fatty acid desaiurase-3 
(FAD S3). The invention also relates to diagnostic meth- 



ods of screening for and detection of FADS1 , FADS2, 
FADS3 and gene therapy utilizing recombinant DNA as 
well as the generation of animal models (knock-in, 
knock-out, transgenic animals), anti-FADSI, anti- 
FADS2, anti-FADS3 antibodies and use in screenings 
for modulating drugs. 
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Description 

Field of the invention 

5 [0001] The present invention relates to the cloning and sequencing of the cDNA molecules of three members of a 
gene family encoding three human fatty acid desaturases, fatty acid desaturase-1 (FADSl), fatty acid desaturase-2 
(FADS2) and fatty acid oesaturase-3 (FADS3). The invention also relates to diagnostic methods o1 screening for and 
detection of FADS1 , FADS2, FADS 3 and gene therapy utilizing recombinant DNA as well as the generation of animal 
models (knock-in, knock-out, transgenic animals), anti-FADSl , anti-FADS2, anti-FADS3 antibodies and use in screen- 

to ings for modulating drugs. 

Background of the Invention 
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[0002] Cellular membranes are dynamic structures in which variable amounts of proteins are embedded in a lipid 
bilayer whose hydrophobic characteristics are largely due to fatty acid moieties of complex lipids (Singer and Nicolson 
1972). The fluidity' of the membranes are achieved by incorporating unsaturated fatty acyl chains of varying lengths 
and varying degrees of unsaturation into the lipids (Stubbs and Smith 1 984). In animals, some of the unsaturated fatty 
acids need to be supplied by the diet ('essential polyunsaturated fatty acids') but, in part, can also be synthesized de 
novo by oxidative desaturation (i.e. formation of double bonds) of saturated fatty acids of plant and animal origin 
Polyunsaturated fatty acid formation requires acetyl-CoA dependent chain elongation and desaturation. Most mam- 
malian tissues can modily acyl chains by introducing more than one double bond with the first one generally at the A- 
9 position between carbons C-9 and C-10. Subsequent double bonds may then be inserted at the A-4, A-5, and A-6 
positions by individual desaturase activities (Cook 1 991 ). 

[0003] For the two mapr precursors of the (n-6) and (n-3) series of polyunsaturated fatty acids, linoleic 18:2(n-6) 
and alpha-linolenic 18:3(n-3) acids, animals depend entirely on their dietary intake. By alternating sequences of de- 
saturation (involving the subsequent action of A4, A5- and A6-desatu rases, respectively) and C2 chain elongation 
linoleic and alpha-linolenic acids are utilized to form arachidonic acid, 20:4(n-6), and the (n-3) acyl chains eicosapen- 
taenoic acid, 20:5(n-3), and docosahexaenoic acid, 22:6(n-3), respectively (Cook 1991). 

[0004] Linoleic and arachidonic acid are the only members of the (n-6) family that accumulate in large quantities in 
Irver and most other animal tissues. The intermediates 18:3(n-6) and 20:3(n-6) are lormed from 18 2(n-6) by A6-de- 
saturation, chain elongation and A5-desaturation (Horrobin 1993). As a component of phospholipids arachidonic acid 
is abundant in cellular membranes but also serves as the primary precursor of oxygenated derivatives such as pros- 
taglandine E2 which is pro- inflammatory and regulates cell function of the immune system. 

[0005] The (n-3) acyl chains eicosapentaenoic acid [20:5(n-3)] and docosahexaenoic acid (22:6(n-3)] are most abun- 
dant in cerebral cortex, letina, and spermatozoa. Although it is generally assumed that the liver is the major source of 
22:6(n-3), rt has been shown that docosahexaenoic acid can also be produced by retinal pigment epithelium (Wang 
and Anderson 1993) as well as brain astrocytes (Moore et al. 1991, Delton-Vandenbrouke et al. 1997) In retinal rod 
outer segments, phospholipids may contain 40-60% of 22:6(n-3) which can markedly influence membrane fluidity due 
to the presence of six double bonds. 

[0006] In recent years there has been increasing interest in the role of polyunsaturated fatty acids in the pathobiology 
of a number of chronic conditions such as coronary and peripheral vascular disease (Horrobin 1 995). acute and chronic 
inflammatory immune responses (Calder 1998, Fan and Chapkin 1998, Grimble and Tappia 1998). cutaneous abnor- 
malities (Horrobin 1989. Grattan etal. 1990), essential hypertension (Russo etal. 1997, Chi and Gupta 1998) diabetes 
mellrtus (Mori et al. 1997), asthma (Leichsenring et al. 1995, Villani et al. 1998, Hodge et a!. 1998) and rheumatoid 
arthritis (James and Cleland 1997, Ariza-Ariza et al. 1998, Grimble and Tappia 1998). A particular role has been at- 
tributed to gamma-linolenic acid [18:3(n-6)] as an anti-cancer polyunsaturated fatty acid. It has been shown that 18 3 
(n-6) confers anticancer properties by a variety ol mechanisms such as (i) up-regulation of E-cadherin, a cell-cell 
adhesion molecule which acts as a suppressor of metastasis (Jiang et al. 1 995), (ii) regulation of desmosome-mediated 
cell-ceil adhesion in human cancer cells (Jiang et al. 1997a), (iii) up-regulation of the metastasis-suppressor gene nm- 
23 thus contributing to the inhibition of the in vitro invasion of tumor cells (Jiang et al. 1998a), (iv) up-regulation of 
maspin expression, a mammary serine protease inhibitor, with profound effects on motility of cancer cells (Jiang et al 
1 997b) and (v) finally inhibition of cell cycle progression via regulation of phosphorylation and subsequent degradation 
of cell cycle inhibitors p27kip1 and p57kip2 (Jiang et al. 1 998b). 

[0007] To further understand lipid-related function in human health and disease additional research into fatty acid 
biosynthesis and metabolism is required. In particular, we need to understand the pharmacological properties, the 
mechanisms of action and the tissue-specific regulation of composition of the polyunsaturated fatty acids and their 
metabolites. This will provide additional insight into the role of the polyunsaturated fatty acids in various chronic disease 
states and will make it feasible to focus pharmacogenomic research on drug design and valuation with the goal of 
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ameliorating acute hearth problems associated with impaired lipid function. As a prerequisite, the genes and their gene 
products involved in the above-mentioned processes need to be identified and characterized. 

[0008] It is the objective of the present invention to provide cDNA molecules of three novel members of the human 
membrane fatly acid desaturase gene family, termed FADS1 , FADS2 and FADS3. The thr e genes share a nucleic 
acid identity of approximately 50-60% and an amino acid identity of about 77% with each other. Similar to other mem- 
brane-bound desaturases from mammals, fungi, insects, plants and cyanobacteria FADS1 , FADS2 and FADS3 reveal 
a hydropathy profile typical of membrane-bound desaturases and share three regions of highly conserved primary 
sequence of the general histidine motif HX 2(3) [XH]H (Shanklin et al. 1994). The histidine residues may act as metal- 
chelating ligands involved in the binding of oxygen in the reaction center (Shanklin et al. 1 995). Together, these features 
confirm FADS1 , FADS2 and FADS3 as novel members of the desaturase family of fatty acyl chain -modifying enzymes. 
[0009] Amino acid identity of FADS1, FADS2 and FADS3 to known desaturases (e.g. from Arabidopsis thaliana, 
Bmssica napus, Synechocystis spec., Borago officinalis, Helianthus annuus, Saccharomyces cerevisiae and 
Caenorhabditiselegans) is restricted to the respective carboxy terminal regions (amino acid positions 260 to 422) re- 
vealing an overall sequence identity of approximately 27%. Interestingly, the respective amino-termini of the three 
novel proteins demonstrate similarities to cytochrome b5 (amino acid positions 4 to 75; Fig. 1). Cytochrome b5 is a 
small hemoprotein and functions as an intermediate donor in a number of oxidation/reduction reactions including e.g. 
the NADH-dependent A9 stearoyl-CoA desaturation (Strittmatter et al. 1974) or the A5 desaturation in cholesterol 
biosynthesis (Reddy et al. 1977). From the amino acid alignments we conclude that FADS1, FADS2 and FADS3 are 
fusion proteins consisting of a N-terminal cytochrome b5 and a C-terminal desaturase-like enzyme. From a functional 
point of view this fusion of two activities may increase the efficiency of electron transport required for desaturation by 
covalently bringing together the presumed electron donor (cytochrome b5) and its putative acceptor (desaturase-like 
enzyme) Other heme fusion proteins containing the cytochrome b5 domain have been identified and represent a 
superfamily of fused proteins (Guiard and Lederer 1979). Besides others this superfamily includes the yeast flavocy- 
tochrome b 2 , sulfite oxidase, nitrate reductase, the yeast A9 acyl-CoA desaturase and more recently the sunflower 
cytochrome b5-desaturase fusion protein (Sperling et al. 1995). The three novel desaturase-like enzymes reported 
herein, FADS1, FADS2 and FADS3, can be added to the growing list of members of this superfamily of fused proteins 
(Fig. 2). 



Summary of the invention 

[001 0] The eukaryotic fatty acid desaturases represent a group of iron-containing enzymes that catalyze NAD(P)H- 
and 0 2 -dependent introduction of double bonds into fatty acyl chains. Impairment of desaturase activities has been 
implicated in a variety of human conditions including liver disease, coronary artery disease and cancer. With the present 
invention we are providing three isolated human cDNA molecules that encode three novel members of a cytochrome- 
b5-containing fusion protein with similarity to plant and lower animal desaturase enzymes, termed fatty acid desaturase- 
1 (FADS1) (represented by Fig. 3 and SEQ ID NO. 1), fatty acid desaturase-2 (FADS2) (represented by Fig. 4 and 
SEQ ID NO. 2) and fatty acid desaturase-3 (FADS3) (represented by Fig. 5 and SEQ ID NO. 3). 



FADS1 protein 

[0011] MAPDPVAAETAAQGPTPRYFTWDEVAQRSGCEERWLVIDRKVYNISEFTRRHPGGS RVISHYAGQDATDP- 
FVAFHINKGLVKKYMNSLLIGELSPEQPSFEPTKNKELTDEFREL RATVERMGLMKANHVFFLLYLLHILLLDGAAWLTL- 
WVFGTSFLPFLLCAVLLSAVQAQA GWLQHDFGHLSVFSTSKWNHLLHHFVIGHLKGAPASWWNHMHFQHHAKPNC- 
FRKD PDINMHPFFFALGKILSVELGKQKKKYMPYNHQHKYFFLIGPPALLPLYFQWYIFYFVIQ RKKWVDLAWMITFY- 
VRFFLTY VPLLGLKAFLGLFFIVRFLESNWFVWVTQMNHIPMHlD HDRNMDWVSTQLQATCNVHKSAFNDWFSGHL- 
NFQIEHHLFPTMPRHNYHKVAPLVQ SLCAKHGIEYQSKPLLSAFADIIHSLKESGQLWLDAYLHQ 



FADS2 protein 

[0012] MGKGGNQGEGAAEREVSVPTFSWEEIQKHNLRTDRWLVIDRKVYNITKWSIOHPGG QRVIGHYAGEDAT- 
DAFRAFHPDLEFVGKFLKPLLIGELAPEEPSQDHGKNSKITEDFRA LRKTAEDMNLFKTNHVFFLLLLAHIIALESIA- 
WFTVFY FGNGWIPTLITAFVLATSQAQAG WLQHDYGHLSWRKPKWNHLVHKFVIGHLKGASANWWNHRHFQHH- 
AKPNIFHKDPD VNMLHVFVLGEWQPIEYGKKKLKYLPYNHQHEYFFLIGPPLLIPMYFQYQIIMTMIVHKN WVDLA- 
WAVSYYIRFFITYIPFYGILGALLFLNFIRFLESHWFVdNTQMNHIVMEIDQEAY RDWFSSQLTATCNVEQSFFNDWFS- 
GHLNFQIEHHLFPTMPRHNLHKIAPLVKSLCAK HGI E YQE KPLLRALLDII RSLKKSGKLWLDAYLHK 
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FADS3 proiein 

[0013] MGGVGEPGPREGPAQPGAPLPTFCWEOIRAHDQPGDKWLVIERRVYDISRWAQRHP GGSRLIGHHGAE- 
°*™FRAF^ FRALHQAAEDMKLFDASPT F FAFLLGHI LAM- 

E VLA WL L I Y L L G PG WVPS AL AAF I LAI SQ AQSWCLQHDLGHASIFKKSWWNHVAQKFVMGQLKGFSAHWWNFRH- 
FQHHAKPNIF HKDPDVTVAPVFLLGESSVEYGKKKRRYLPYNQQHLYFFLIGPPLLTLVNFEVENLAY MLVCMQWA- 
DLLWAASFYARFFLSYLPFYGVPGVLLFFVAVRVLESHWFVWITQMNHI PKEIGHEKHRDWVSSQLAATCNVEPSLF- 
TNWFSGHLNFQIEHHLFPRMPRHNYSRVAPLVKSLCAKHGLSYEVKPFLTALVDIVRSLKKSGDIWLDAYLHQ 
[001 4] si udies to clarify the specificity and the subcellular location of these ubiquitiously expressed fusion proteins 
are in prog, ess. Also, the detailed cellular functions and dysfunctions of the desaturase-like domains are being inves- 
tigated in appropriate cellular and animal systems. This will address the question whether and to which extent these 
novel enzymes are involved in human disease. The invention encompasses the three cDNA molecules FADS1 FADS2 
and FADS3, the nucleotide sequence of these cDNAs, and the putative amino acid sequences of ihe FADS1 (repre- 
sented by Fig.t B and SEQ ID NO. 4), FADS2 (represented by Fig. 7 and SEQ ID NO. 5), and FADS3 represented by 
7 * Fig. 8 and SEQ ID NO. 6) proteins. 

[001 5] Also comprehended by this invention are oligonucleotide primers comprising the cDNA molecule or its com- 
plementary strand allowing the amplification of FADS1 (represented by Fig. 9 and SEQ ID NOS 7-12) FADS2 (rep- 
resented by Fig. 9 and SEQ ID NOS. 13-18), and FADS3 (represented by Fig. 9 and SEQ ID NOS 19-22) by the 
reverse transcriptase polymerase chain reaction (RT-PCR). Such primers are particularly useful and will provide re- 
searchers and physicians with an enhanced ability to assess the role of FADS1 , FADS2, and FADS3 in human disease 
The present invention also relates to methods of screening for and detection of FADS1 , FADS2, and FADS3 mutation 
carriers including prenatal FADS1 , FADS2, and FADS3 screening and diagnosis 

[001 6] Having provided the isolated human FADS1 , FADS2, and FADS3 cDNA sequences, also comprehended by 
this invention are the FADS1 , FADS2, and FADS3 proteins, and derivatives thereof, in aspects of diagnosis and treat- 
ment of human disease. Finally, the invention pertains to proteins which comprise the same or substantially the same 
ammo acid sequence (at least 200 amino acids) as lhat represented by Figs. 6, 7, 8 and SEQ ID NOS 4 5 6 or a 
mof-^ ° f am,noac,d sequences having a deletion, addition or substitution of 1 to 10 amino acids or its salt 
[0017] Another aspect of the invention is the use of the FADS1 , FADS2, and FADS3 proteins as a target for drug 
and gene therapy in the Ireatment of human disease. This includes the generation and utilization ol FADS1 FADS2 
and FADS3- targeted animal models (knock-in, knock-out, transgenic animals) and anti-FADS1 -FADS2 and -FADS3 
antibodies that specifically detect Ihe FADS 1 , FADS2, and FADS3 proteins, respectively. 

[0018] The foregoing and other features and advantages of the invention will become more apparent from the fol- 
lowing detailed description and accompanying drawings. 

[001 9] One aspect of the invention are the isolated cDNAs selected from the group consisting of: 

(a) a polynucleotide having at least a 65 % homology, preferabV at least a 80 % homology with a polynucleotide 
encoding a polypeptide selected from the group consisting of the polypeptides of SEQ ID NO: 4. SEQ ID NO" 5 
and SEQ ID NO: 6; 

(b) a polynucleotide having at least a 65 % homology, preferably at least a 80 % homology with a polynucleotide 
which by virtue of the redundancy of the genetic code, encodes the same polypeptide selecled from the group 
consisting of the polypeptides of SEQ ID NO: 4, SEQ ID NO: 5 and SEQ ID NO: 6; 

(c) a DNA molecule capable of hybridization under stringent conditions lo a DNA molecule according to (a) or (by 

(d) a polynucleotide which is complementary to the polynucleotide of (a), (b) or (c); and 

(e) a oligonucleotide comprising at least 15 consecutive nucleotides of the polynucleotide of (a), (b), (c) or (d) 

(including DNAs which are synonymous to the DNAs of (a), (b), (c), (d) and (e) due to the degeneracy of the genetic 
especially isolated cDNAs selected from the group consisting of: 

(a) a polynucleotide having at least a 65 % homology, preferabV at least a 80 % homology with a polynucleotide 
sequence selected from the group consisting of the polynucleotides of SEQ ID NO: 1 , SEQ ID NO: 2 and SEQ ID 

(b) a DNA molecule capable of hybridization under stringent conditions to a DNA molecule according to (a)- 

(c) a polynucleotide which is complementary to the polynucleotide of (a) or (b); 

(d) a oligonucleotide comprising at least 15 consecutive nucleotides of the polynucleotide of (a) (b) or (cV and 

(e) a DNA which is synonymous to the DNAs of (a), (b), (c) or (d) due to the degeneracy of the genetic code. 

[0020] In the scope of the invention are polynucleotides having a polynucleotide encoding a polypeptide s lected 
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from the group consisting of the polypeptides of SEQ ID NO: 1 , SEQ ID NO: 2 and SEQ ID NO: 3 and polynucleotides 
having a polynucleotide sequence selected from the group consisting of the polynucleotides of SEQ ID NO: 4, SEQ 
ID NO: 5 and SEQ ID NO: 6, but DNAs comprising a nucleotide sequence with at least a 65 % homology with these 
nucleotide sequences is also within the scope of the invention. 
5 [0021] Furthermore within the scope of th invention are: 

[0022] A recombinant vector comprising the disclosed DNA molecules. 

[0023] Transgenic host cells such as COS7, fibroblast cell lines or any other tissue-specific cell lines, as well as a 
transgenetic host cell tranformed by the DNA or the vector, a corresponding transgenetic organism or a corresponding 
transgenetic knock-in or knock-out animal model. 

io [0024] Polypeptides and corresponding proteins comprising at least 65 %, preferably 85 %, especially 100 % of a 
polypeptide sequence selected from the group consisting of the polypeptides of SEQ ID NO: 1, SEQ ID NO: 2 and 
SEQ ID NO: 3; polypeptides comprising a polypeptide sequence with at least a 65 % homology with the said polypep- 
tides; peptides comprising at least 15, preferably 30, especially 60 consecutive amino acids of the said polypeptides; 
and polypeptides having substantially the same amino acid sequence as the said polypeptides, or having a variant of 

is the amino acid sequence of the polypeptides with a deletion, addition or substitution of 1 to 10 amino acids. The salts 
of the peptides and proteins are also within the scope of the invention. 

[0025] A process lor preparing the proteins which comprises cultivating the transformants to form the proteins. 
[0026] A method of screening for modulators in well known assays using constructs such as FADS1, FADS2, and 
FADS3 promoter luciferase or green fluorescent protein hybrids or screening for interacting proteins or factors using 
20 state of the art technologies like the interaction trap technology to screen for interacting substances of FADS1 , FADS2, 
and FADS3 or isolated domains of FADS1, FADS 2, and FADS3. 

A method of screening chemical libraries comprising transformed cell lines 

25 [0027] A compound which alters 1 reacts with at least one epitope of the proteins and which is obtained by screening 
methods utilizing the FADS1, FADS2, and FADS3 cDNAs or protein molecules. 

[0028] Use of antibodies against the FADS1 , FADS2, and FADS3 proteins for diagnostic or therapeutic purposes. 

[0029] A pharmaceutical composition comprising as an effective component of the proteins or a partial peptide of 

the proteins, and a pharmaceutical^ acceptable carrier or diluent. 
30 [0030] The term "knock-out animal" as used herein is intended to describe an animal containing a gene which has 

been modified by homologous recombination. The homologous recombination event may completely disrupt the gene 

such that a functional gene product can no longer be produced (hence the name "knock-out") or the homologous 

recombination event may modify the gene such that an altered, although still functional, gene product is produced. 

[0031] The term "knock-in" as used herein is intended to describe a variation of gene targeting that uses homologous 
35 recombination but allows expression of added genetic sequences in place of the endogenous gene. This approach 

allows the test of more subtle mutations than is allowed by a simple knock-out. 

[0032] The term "epitope" describes a region on a macromolecule which is recognized by an antibody. Frequently it 
is in a short region of primary sequence in a protein and it is generally about 5 to 12 amino acids long (the size of the 
antigen binding site on an antibody). Carbohydrates, nucleic acids and other macromolecules may be antigens and 
40 have epitopes. 

Detailed Description of the Invention 

Materials and Methods 

45 

[0033] Isolation of the FADS1 and FADS2 cDN As cDNA fragments corresponding to FADS1 and FADS2 were iden- 
tified by direct cDNA selection. The cDNA selection was performed essentially as described (Rommens et al. 1993) 
with only minor modifications. Briefly, total RNA was prepared from human retina and from established human retinal 
pigment epithelium cell line ARPE-1 9 (Dunn et al. 1 996). Prior to the use as templates for cDNA synthesis the isolated 
so RNAs were separated on a 1 .2% agarose gel in the presence of 3-(/V-morpholino)propanesulfonic acid (MOPS) and 
formaldehyde to check their integrity (Sambrook etai, 1989). 

[0034] RNAs were reverse transcribed using the SUPERSCRIPT™ preamplification system for first strand cDNA 
synthesis (Gibco, BRL) and the RXGT 12 oligonucleotide primer (5'-CGG AAT TCT CGA GAT CTT TTT TTT TTT TT- 
3'). After poly(A)-tailing with terminal transferase (United States Biochemical, USB), a cDNA pool was generated by 
55 RXGT 12 -primed PCR at 94°C for 1 min; 2 cycles of 94°C, 30 sec; 37°C, 1 min, 72°C, 2 min followed by 22 cycles of 
94°C, 30 sec; 58°C, 30 sec and 72°C, 2 min. Prior to hybridization the cDNA pools were pre-annealed to C C M DNA 
(Gibco, BRL) enriched with sonicated LINE1 sequences. 

[0035] Genomic PAC clones for cDNA selection were derived from 11 q12-q1 3.1 , a region known to contain the gene 
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underlying Best's vitellifornn macular dystrophy (Stohr et al. 1998). The assembly and orientation of the clones have 
been described previously (Cooper et al. 1997). Inserts from PAC clones dJ465G21 and dJ139E20 (—1 ug) were 
isolaled by Notl digestion, purified using OIAEXM agarose gel extraction beads (Qiagen) and immobilized on Hybond- 
N+ membrane filters with an average concentration of 60 ng/mm 2 . The insert filters were subjected to two consecutive 

5 rounds of hybridization with a starting mixture of 20 u,g of retina and ARPE-1 9 derived cDNAs. Hybridization time was 
four days at 58°C in Church hybridization buffer (Church and Gilbert 1984). Filters were washed three times in 2 x 
SSC/0.1% SDS at room temperature, once each in 0.5 x SSC/0.1% SDS, 0.2 x SSC/0.1% SDS and 0.2 x SSC/0.05% 
SDS (all at 58°C). A final wash was in 2 x SSC. cDNAs were eluted in distilled HgO by incubating for 10 min at 98°C 
and reamplified by PCR using the RXGT 12 oligonudeotide primer. Four jig of the reamplified cDNAs were used for a 

io second round of hybridization. After two rounds of selection the cDNAs were amplified using the RXGT 12 oligonucle- 
otide primer, digested with EcoRI and cloned into the EcoRI site of pBluescript (Stratagene). 

[0036] The selected cDNAs represent segments of the 3'-untranslated region (3'-UTR) of FADS1 (clone IVC4 at 
FADS1 nucleotide position 3793-4204; clone IVB7 at nucleotide position 3132-3609; done VHC6 at nucleotide position 
2077-231 7) (Fig. 3) and of the 3' UTR/coding sequence of FADS2 (done I VB8 at FADS2 nucleotide position 2626-3009; 

is clone TUK8-4B at nucleotide position 753-1 508) (Fig. 4). 

[0037] Using the selected clone sequences extensive dbEST database searches were conducted and revealed a 
large number of additional overlapping expressed sequence tags (ESTs). More than 100 ESTs (e.g. zk09h08, 
EST177650, yb28c03, ym29b05 : yx67h05) were assembled to an overlapping EST contig representing FADS1. The 
assembled EST sequences contain an open reading frame (ORF) of 1410 bp : with a first potential in-frame translation 

20 initiation codon, ATG, starting 79 nucleotides downstream the most 5'end of EST clone zk09h08.r1 (GenBank acc. no. 
AA029030) (Fig. 1a). A consensus polyadenylation signal, AAUAAA, was identified at nucleotide position 4.182. The 
mature protein predicted from the ORF consists of 444 amino acid residues resulting in a calculated molecular mass 
of 52.0 kDa (Fig. 6). 

[0038] Another 30 overlapping ESTs (e.g. cp2485.seq, HSC2EA121, EST06759, ym42c04, nc08c05) were found 
2S facilitating the assembly of the FADS2 cDNA. The assembled EST sequences contain an open reading frame (ORF) 
of 1 352 bp, with a first potential in-frame translation initiation codon, ATG, starling 21 nucleotides downstream the most 
5'end of EST done ub64e01.r1 (GenBank acc. no. AI036465) (Fig. 4). Consensus polyadenylation signals were pre- 
dicted at nucleotide positions 2.996 and 4.056. The mature FADS2 protein predicted from the ORF consists of 444 
amino acid residues resulting in a calculated molecular mass of 52.3 kDa (Fig. 7). Amino acid sequence identity between 
30 FADS1 and FADS2 is 62%. 

Isolation of the FADS3 cDNA 

[0039] Additional 30 human EST clones were available to assemble a third individual cDNA, termed FADS3 (e.g. 

35 zs84e06, zs84e05, nq23f05, ya49al9, zs86h09). The existence of a third member of the FADS family was confirmed 
by PCR mapping of FADS1-, FADS2- and FADS3-specific 3'-UTR fragments revealing three distinct gene loci within 
a 1.4 Mb PAC contig in 11q12-q13.1 (Cooper et al., 1997). The assembled EST sequences contain an open reading 
frame (ORF) of 1468 bp, with a first potential in-frame translation initiation codon, ATG, starling 134 nucleotides down- 
stream the most 5'end of EST clone qa99d06.s1 (GenBank acc. no. AM 23992) (Fig. 5). The mature protein predicted 

40 from the ORF consists of 445 amino acid residues resulting in a calculated molecular mass of 51.2 kDa (Fig. 8). The 
3'-UTR of the FADS3 cDNA is represented by several EST clones (e.g. zs86h09.s1 , AA279632). A potential polyade- 
nylation signal, AUUAAA, is present at cDNA nucleotide position 1 .757 and may be functional as AUUAAA is the most 
common natural variant of the consensus polyadenylation signal AAUAAA (Fig. 5) (Sheets et al, 1990). 
[0040] Amino acid sequence identities between FADS1 and FADS3 as well as between FADS2 and FADS3 are 52% 

4S and 63%, respectively. All EST sequences in the dbEST databases could be aligned to one of the three cDNAs : FADS1 , 
FADS2, and FADS3. This suggests that there are no additional members of the FADS family in the human genome. 

Northern blot analysis 

so [0041] Northern blot analysis was performed either with total RNA isolated using the guanidinium thiocyanate method 
(Chomczynski and Sacchi 1987) or with commercially available multiple tissue Northern (MTN) blots purchased from 
Clontech Laboratories Inc. (Palo Atto, CA). Each lane of the total RNA blot contained 12 u^g of total RNA from lung, 
cerebellum, uterus, retina, liver, heart, RPE cell line ARPE-1 9, RPE tissue, lymphocytes and was electrophoretically 
separated in the presence of formaldehyde. The MTN blots were prepared from poly(A) + RNA isolated from human 

ss heart, brain, placenta, lung, liver skeletal muscle, kidney and pancreas. Inserts of clones IVC4, IVB7 (FADS1), IVB8 
(FADS2) and of the 362 bp PCR product F3/R (^-ACAGCTTTCCCCCAATTCTC-S'/S'-GGCCTCAGCTACGAAGT- 
GAAG-3') (FAD S3) derived from the 3'-UTRs of the respective genes were used for filter hybridization at 65°C in 0.5 
mM sodium phosphate buffer, pH 7.2; 7% SDS, 1 mM EDTA at 65°C (Church and Gilbert 1984). 
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[0042] The three genes are ubiquitiously expressed and appear to have similar expression levels in all tissues an- 
alyzed. FADS1 revealed a transcript size of A 0 kb while FADS2 revealed a similar sized transcript of 4.0 kb in addition 
to a smaller transcript of approximately 3.1 kb. The two FADS2 variants may be due to differential usage of polyade- 
nylation signals (see above). Finally, FADS3 is represented by two transcripts of 1 .75 kb and 1 .25 kb in size. While the 
s former is in agreement with the usage of the variant polyadenylation signal identified at position 1738 of the cDNA, the 
small size of the latter transcript can not be explained at present and it does not appear to be due to a differential usage 
of polyadenylation signals. Possibly, differential splicing and/or exon skipping may be involved in the generation of the 
variant transcript. However, there is no evidence from cDNA cloning or EST contig assembly to support this possibility. 

10 Comparison with other desaturases 

[0043] Local sequence alignments of the deduced amino acid sequences of FADS1 , FADS2, and FADS3 with known 
proteins or protein motifs were done using SwissProt (http://www.ncbi. nlm.nih.gov/cghbin/Blast/nph-blast?Jform=0) 
and the BLASTP and BEAUTY programs at Baylor College of Medicine (http://dot.imgen.bcm.tmc.edu: 9331 /seq- 
15 search/protein-search.html). Amino acid sequence alignments were performed using the CLUSTALW multiple align- 
ment program at http://pbil.ibcp.fr/NPSA/npsa_clustalw.htmL Phylogenetic tree assembly was done using the 
TREECON software Version 1.3b available at http://bioc-www.uia.ac.be/u/yvdp/index.html. 

[0044] Overall amino acid identities to known desaturases were found to be in the range of 22% - 27% (Fig. 1). 
Phylogenetic tree construction revealed a genetic relationship of FADS1, FADS2, and FADS3 to the A5-, A6- and 

20 A8-desaturases with some distance to the A9-desaturases (Fig. 2). From these analyses it becomes obvious that 
sequence identity by itself is not a predictor of a specific desaturase activity. For example, A5- and A6-desatu rases 
from C. elegans demonstrate a higher sequence identity to each other than to the A6-desaturases from other species. 
We therefore conclude that based on simple sequence comparisons it is not feasible to determine the specific functions 
of FADS1, FADS2, and FADS3. This will be done by transgene expression of the three desaturases combined with 

25 gas chromatograpy-mass spectometry. 

[0045] Hydropathy plots of FADS1 , FADS2 and FADS3 indicate two hydrophiobic sequences predicted to represent 
transmembrane-spanning domains similar to other desaturases identified thus far (Fig. 1) (reviewed in Sperling et al. 
1995). 

30 cDNA amplification of FADS1 , FADS2, and FADS3 

[0046] The coding sequences of the three genes are amplified in overlapping fragments by performing FTT-PCR using 
oligonucleotide primer pairs derived from the respective cDNA sequences: 

35 (i) FADS1 (Fig. 9 and SEQ ID NOS. 7-12) 

[0047] Sense primer TU12-R5 (5'-CGCCTGACAGCCCCTGCT-3*) at cDNA position 31-48 in combination with anti- 
sense primer TU12-F10 (5'-CAGGTGGCCAATCACAAAAT-3') at cDNA position 671-690 results in a product of 660 
bp; sense primer TU12-R7 (5'-CTC AAAGTG G AAC C ATCTG CTA-3') at cDNA position 645-666 in combination with 
40 antisense primer TU12-F9 (5'-GGAAACCCAGTCCATGTTCC-3') at cDNA position 1130-1149 results in a product of 
505 bp; sense primer TU12-R6 (S'-CCTGGG CCTTTTCTTCATAGT-3') at cDNA position 1035-1055 in combination with 
antisense primer TU12-F5 (5'-CTCAAGCTCCCCTCTGCCT-3') at cDNA position 1465-1483 results in a product of 449 
bp. 

45 (2) FADS2 (Fig. 9 and SEQ ID NOS. 1 3-18) 

[0048] Sense primer TU13-R4 (5'-TCAGAAGCATAACCTGCGC-3') at cDNA position 98-116 in combination with 
antisense primer TU13-F7 (5'-CC AGTTCACC AATC AGCAGG-3') at cDN A position 284-303 results in a product of 206 
bp; sense primer TU13-R3 (5'-CCCCTGCTGATTGGTGAACT-3') at cDNA position 282-301 in combination with anti- 
50 sense primer TU13-F4 (5'-TGTAGGGCAGGTATTTCAGC -3') at cDNA position 779-798 results in a product of 517 bp; 
sense primer TU13-R2 (5'-AGCCCATCGAGTACGGCAA-3') at cDNA position 754-772 in combination with antisense 
primer TU13-F1 (5'-CCTCAGAACAAAAGCCCATC-3') at cDNA position 1416-1435 results in a product of 682 bp. 

(3) FADS3 (Fig. 9 and SEQ ID NOS. 19-22) 

ss 

[0049] Sense primer TU1 9-R2 (5'-TCTTGCTCGGACCTCGGC-3') at LLCDL3 cDNA position 81 -98 in combination 
with antisense primer TU19-F2 (S'-GTGATCCACACGAACCAGTG-S') at cDNA position 1130-1149 position results in 
a product of 1069 bp; sens primer TU19-R3 (5'-GAAGAACCCAGCCAGGATG-3") at cDNA position 428-446 in com- 
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bination with antisense primer TU19-F3 (5'-ACAGCTTTCCCCCAATTCTC-3') at cDNA position 1709-1728 results in 
a product of 1 301 bp. 

Short description of Figur s 

5 

[0050] 

Fig. 1 Comparison of putative amino acid sequences from FADS1 , FADS2, FADS3, Borago officinalis, Helianthus 
annuus and human cytochrome b5. Arrowheads indicate eight invariant amino acid residues typical for the cyto- 
io chrome b5 domain. Two potential transmembrane domains are boxed. Three histidine motifs HX 2(3) [XH]H that are 

conserved within the desaturase family are hatched. 
Fig. 2 Phylogenetic tree of fatly acid desaturases. 

Fig. 3 (SEQ ID NO. 1 ) shows the nucleotide sequence of the FAD SI cDNA 
Fig. 4 (SEQ ID NO. 2) shows the nucleotide sequence of the FADS 2 cDNA 

1S Fig. 5 (SEQ ID NO. 3) shows the nucleotide sequence of the FADS33 cDNA 

Fig. 6 (SEQ ID NO. 4) shows the putative amino acid sequence of the predicted FADS1 protein 
Fig. 7 (SEQ ID NO. 5) shows the putative amino acid sequence of the predicted FADS2 protein 
Fig. 8 (SEQ ID NO. 6) shows the putative amino acid sequence of the predicted FADS3 protein 
Fig. 9 (SEQ ID NOS. 7-22) shows the oligonucleotide PCR primers utilized to amplify the FADS1, FADS2, FADS3 

20 cDNA, respectively 
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Annex to the application documents - subsquentlv filed sequences listing 



[0052] 



SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

10 (i) APPLICANT: 

(A) NAME: MultiGen Biotech GmbH 

( B) STREET: Am Hubland 

(C) CITY: Wuerzburg 

(D) STATE: - 

(E) COUNTRY: Germany 

(F) POSTAL CODE (ZIP): 970*74 

(G) TELEPHONE: 0931-7058-4340 

(H) TELEFAX: 0931-7058-4 3 55 

(I) TELEX: - 



(ii) TITLE OF INVENTION: cDNA molecules of the members of a gene family 
encoding human fatty acid desaturases and their use in 
20 diagnosis and therapy 



(iii) NUMBER OF SEQUENCES: 22 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

{ B ) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version 81.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 4 amino acids 
{ B ) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/ KEY : Protein 

(B) LOCATION :1. .444 

40 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4204 base pairs 

(B) TYPE: nucleic acid 
<C: STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 
50 (A) NAME /KEY: exon 

(B) LOCATION : 1 . .4204 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CACTCCTGGA GCCCGCGGAC CCCGAGCACG CGCCTGACAG CCCCTGCTGG CCCGGCGCGC 60 

GGCGTCGCCA GGCCAGCTAT GGCCCCCGAC CCGGTGGCCG CCGAGACCGC 
GGCTCAGGGA 120 

CCTACCCCGC GCTACTTCAC CTGGGAC GAG GTGGCCCAGC GCTCAGGGTG 
CGAGGAGCGG 180 

TGGCTAGTGA TCGACCGTAA GGTGTACAAC ATCAG CGAGT TCACCCGCCG 
GCATC CAGGG 24 0 

GGCTCCCGGG TCATCAGCCA CTACGCCGGG CAGGATGCCA CGGATCCCTT 
TGTGGCCTTC 300 

CACATCAACA AGGGCCTTGT GAAGAA GT AT ATGAACTCTC TCCTGATTGG 
AGAACTGTCT 3 60 

CCAGAGCAGC CCAGCTTTGA GCCCACCAAG AATAAAGAGC TGACAGATGA 
GTTCCGGGAG 4 20 

CTGCGGGCCA CAGTGGAGCG GATGGGGCTC ATGAAGGCCA ACCATGTCTT 
CTTCCTGCTG 4 80 

TACCTGCTGC ACATCTTGCT GCTGGATGGT GCAGCCTGGC TCACCCTTTG 
GGTCTTTGGG 54 0 

ACGTCCTTTT TGCCCTTCCT CCTCTGTGCG GTGCTGCTCA GTGCAGTTCA 
GGCCCAGGCT 600 

GGCTGGCTGC AGCATGACTT TGGGCACCTG TCGGTCTTCA GCACCTCAAA 
GTGGAACCAT 660 

CTGCTACATC ATT T T GT GAT TGGCCACCTG AAGGGGGCCC CCGCCAGTTG 
GTGGAACCAC 720 

ATGCACTTCC AGCACCATGC CAAGCCCAAC TGCTTCCGCA AAGACCCAGA 
CATCAACATG 7 80 

CATCCCTTCT TCTTTGCCTT GGGGAAGATC CTCTCTGTGG AGCTT GGGAA 
ACAGAAGAAA 8 4 0 

AAATATATGC CGTACAACCA CCAGCACAAA TACTTCTTCC TAAT^GGGCC 
CCCAGCCTTG 900 

CTGCCTCTCT ACTTCCAGTG GTATATTTTC TATTTTGTTA TCCAGCGAAA 
GAAGTGGGTG 960 

GACTTGGCCT GGATGATTAC CTTCTACGTC CGCTTCTTCC TCACTTATGT 
4S GCCACTATTG 1 020 

GGGCT GAAAG CCTTCCTGGG CCTTTTCTTC ATAGT CAGGT TCCTGGAAAG 
CAACTGGTTT 1080 

GTGTGGGTGA CACAGATGAA CCATATTCCC ATGCACATTG AT CAT GACCG 
SO GAACAT GGAC 114 0 

TGGGTTTCCA CCCAGCTCCA GGCCACATGC AATGTCCACA AGTCT GCCTT 
CAATGACTGG 1200 

TTCAGT GGAC ACCT CAACTT CCAGATTGAG CACCATCTTT TTCCCACGAT 
SS GCCTCGACAC 1260 
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AATTACCACA AAGTGGCTCC CCTGGTGCAG TCCTTGTGTG CCAAGCATGG 
CATAGAGTAC 1320 

CAGTCCAAGC CCCTGCTGTC AGCCTTCGCC GACATCATCC ACT C ACT AAA 
GGAGTCAGGG 1380 

CAGCTCTGGC TAGATGCCTA TCTTCACCAA TAACAACAGC CACCCTGCCC 
AGTCTGGAAG 1440 

AAGAGGAGGA AGACTCTGGA GCCAAGGCAG AGGGGAGCTT GAGGGACAAT 
G C CAC TAT AG 1500 

TTTAATACTC AGAGGGGGTT GGGTTTGGGG ACATAAAGCC TCTGACTCAA 
ACTCCTCCCT 1560 

TTTATCTTCT AGCCACAGTT CTAAGACCCA AAGTGGGGGG TGGACACAGA 
AGTCCCTAGG 1620 

AGGGAAGGAG CTGTTGGGGC AGGGGTGTAA ATTATTTCCT TTTTCTAGTT 
TGGCACATGC 1680 

AGGTAGTTGG TGAACAGAGA GAACCAGGAG GGTAACAGAA GAGGAGGGAC 
CTACTGAACC 174 0 

CAGAGTCAGG AAGAGATTTA AC AC T AAAAT TCCACTCATG CCGGGCGTGG 
TGCACGCGCC 1800 

TGTAATCCCA GCTACCCAGG AGGCTGAGGC AGGAGAATCG CTTGAACCGG 
GGAGGTGGAG I860 

GTTGCAGTGA GCTGAGATCA CGCCATTGTA CTCCAGCCTG GGCGACAGAG 
CAAGACTCCA 1920 

TTTCAAAAAA AAAAAAAAAA AAAAAAATCC ACT CAT AT AA AAGGTGAGCT 
CAGCTCACTG 198 0 

GTCCATTTCT CAGTGGCTTC TCCATCCTCA TTTGCAAACC TCAGAGGGAT 
AAGGCAGTTG 204 0 

AACCTGATGA GCAAGAATTA TAACAGCAAG GAAACATTAA TGCTTAGAAT 
TCTGAGATCC 2100 

AGCACAACTC AGTCTGTGGG AGCTCAGCTC GCTGCCCAGG GATAGGT AT G 
ACCTATGTCT 2160 

GCCTTAGGCT GCTGGGAGAT GCCATTCTCC AGTTTCAGAA GCAGGCAGGG 
CAAAGGTCAA 222 0 

GACTGTGGTA TTGGGGTCTT TTGGCTCTGA AGGATCCTGG AACCACTGAT 
TTTGGTTTAT 22 80 

TCCCTCCAGG GTCTAAAGAG AACAAGAGGT GCTAGCTCTT AC CAAAAC AG 
ATGGTAGAGA 2 34 0 

GAGTTGCTGG CTATTTAAAA AGCTCTTTCA TCTTTTAATT CACCTCTTCT 
TTTCACCTCT 24 00 

TTAACCACTC CTCAGGAACA GAACACTTCT AGGACTGGGG GTCTTTTAGC 
TCCATAAGCA 24 60 

AGTGAGCAGA TGGGACAAGT TAGTCTTTTC TCCCTAGAAA CAAAGGGGAT 
GCCCAGTGGT 2 52 0 
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TTCCCTTTGC TTCCCAACCT AAAATTTCAA GTTTAATAAA ATAGCAATTA 
GCAGAAGTGA 2 580 

CCAAATTGGG AGATAATTAT CAGTCATGAG GAAAGACACA GATTTCGGTC 
ATAAAGAATG 2 64 0 

T AAGGGC TAT AAGTAGAAAC TTTCTATAAC CTAAATGATG TTATAGAATT 
ATTTTTGAGC 2 7 00 

AGGAGCAGAA AGATTAAATA TGATCACTTC ATACTTCTAA AT CAGAAATA 
GGAAGATTAA 27 60 

AACCACAGAA CAGTTTGTGA TTTCTATTGC TGGTAGCTAG G TAT CT TACT 
CTGTCCACTC 2 820 

TTGTTCAAGT ATCTAACTCT TCTGGAAACC AAATAGGCTT TAGAAGAGAT 
TAT CCT AT AT 2 8 80 

TCCTATCAGT ATAATACTAA AATGTAACTT TTTAATCATC TGGTTTTTAA 
AAGATAAACA 2 940 

GTTTAGCCCA TCTCTCCAGA GAGCAAACAT AGGAAT AT GA CTCAGGAGCC 
TCCTAGGGCT 3 000 

TAT CATCAG C CCTCACACCC GCTTCCCCCT CCAACCCACA GCCTTTGCTT 
CCAGGTGGCA 3 060 

GGATTACTAC TTTGCCTCTT CAGCAGCATC TACTCTAGGC ATATTGATCA 
TTTTAGACAC 3120 

TGGGAGAAGA GAACCTCAAA CTAGGAGGAA AAGACAGAGC C T C CACTT AG 
TTTTGGGAGG 3180 

GGATGGCAGA CAGTCAAGGA GATGAGCGTC CTAAGGCATG TTGGGATAGG 
GTCAGATGCA 324 0 

CCACCCATGG AGAGGTTTGT CAACACAAAG ACATGGAAGG TTAGAGGTTT 
GTCAACAAAA 3300 

AGACATG GAA GGTTAGGTTT GTCAACACAA AGACAT GGAA GAT TAGAGGT 
TTGTCAACAC 3 360 

AAAGATACAG GAAGAATGGG CTGCAGAAGA TTTAGATGTT TTCCATTTGG 
GCACATTTTA 3 420 

CTTAGCTGGA GAAC TAGGTT TAAAACAGCC TGGGTAGGAA AATTAGAAGC 
AAGCTGGATG 3 4 80 

CAGTGGCTCA TGCCTGTAAT CCCAACACTT TTGGGAGGTC CAGGCAGGAG 
GATCACTTGG 3 54 0 

GCCCAGGAGG TCAAGCCTGC AGCGAGCTGA GATCACACCA CTGCACTCCA 
GCCTGGGGTG 3600 

ATAGAACAAG ACCCTGTCTC AAAAAAAAAA AAAAACAACA AAAACTTAGA 
ATTGAGGAGT 3660 

TGTACCTCCA TTGGCTTCCT CACTCCAAAA TAGGTGCTGA TCCTTCCTAT 
TCCTATTCTT 3720 

TGCCACCTTT TGGGTGTGGT GTCACCAGCC TGTTTAGCCA AGTAGCTTTG 
GGCATAGGCT 37 80 
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GCCCAATCTG AGCAAACACC AGTGAGGCTC T ATT GAGCAA GACCAAGTCC 
TCAAAGCACC 384 0 

5 TGAACCACTG TGGCCTTCTC AGCCTACAGC AGTGTGGTCT CTTACATGGC 

CACAAAGGGA 3900 

CACACAGTGA CAAAAGGCTC GGAATGTTAC AATGGTAAAA TGAGTGATCT 
CAAATCCACT 3960 

10 

GACAGATATA AAATAGGCTT AGAGAGGAAA AGCTGCCTCT GGTCAAGTAG 
ATCATGGCAG 4 02 0 

CATGAATTCC AACTCACTTT TTTACGAACT CCAACTTCTA TGTTTATCTT 
75 TGTTACTTTC 4 080 

ACTTTTTTAC AACCTGCAGA GGCATTTTTT AAATGAGGCC CAATATCAGT 
ATTCTTTTTG 414 0 

20 TGTGTGCCAA TTTTGTTATC ACATCCCTAT GAAGTTGAAA AATAAAGTTA 

ATTTTGACCA 4200 

AAAG 4204 



25 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4089 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 1. .4089 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CGTCACAGTC GGCAGGCAGC ATGGGGAAGG GAGGGAACCA GGGCGAGGGG 
GCCGCCGAGC 60 

GCGAGGTGTC GGTGCCCACC TTCAGCTGGG AGGAGATTCA GAAGCATAAC 
CTGCGCACCG 12 0 

ACAGGTGGCT GGTCATTGAC CGCAAGGTTT ACAACATCAC CAAATGGTCC 
ATCCAGCACC 180 

CGGGGGGCCA GCGGGT CATC GGGCACTACG CTGGAGAAGA TGCAACGGAT 
GCCTTCCGCG 240 

CCTTCCACCC TGACCTGGAA TTCGTGGGCA AGTTCTTGAA ACCCCTGCTG 
ATTGGTGAAC 300 

TGGCCCCGGA GGAGCC CAGC CAGGACCACG GCAAGAACTC AAAGAT CACT 
GAGGACTTCC 360 

GGGCCCTGAG GAAGACGGCT GAGGACATGA ACCTGTTCAA GACCAACCAC 
GTGTTCTTCC 420 

TCCTCCTCCT GGCCCACATC ATCGCCCTGG AGAGCATTGC AT G GT T CACT 
GTCTTTTACT 4 80 

TTGGCAATGG CTGGATTCCT ACCCTCATCA CGGCCTTTGT CCTTGCTACC 
TCTCAGGCCC 540 

AAGCTGGATG GCT GCAACAT GATTATGGCC ACCTGTCTGT CTACAGAAAA 
CCCAAGTGGA 600 

ACCACCTTGT CCACAAATTC GTCATTGGCC ACTTAAAGGG TGCCTCTGCC 
AACTGGTGGA 660 

ATCATCGCCA CTTCCAGCAC CACGCCAAGC CTAACATCTT C CACAAG GAT 
CCCGATGTGA 72 0 

ACATGCTGCA CGTGTTTGTT CTGGGCGAAT GG CAGC C CAT CGAGT AC GGC 
AAGAAGAAGC 7 8 0 

TGAAATACCT GCCCTACAAT CACCAGCACG AATACTTCTT CCTGATTGGG 
CCGCCGCTGC 84 0 

TCATCCCCAT GTATTTCCAG T AC CAGATCA T CAT GAC CAT GAT CGT C CAT 
AAGAACT GGG 900 
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TGGACCTGGC CTGGGCCGTC AGCTACTACA TCCGGTTCTT CATCACCTAC 
ATCCCTTTCT 960 

ACGGCATCCT GGGAGCCCTC CTTTTCCTCA ACTTCATCAG GTTCCTGGAG 
AGCCACTGGT 1020 

TTGTGTGGGT CACACAGATG AATCACATCG TCATGGAGAT TGACCAGGAG 
GCCTACCGTG 1080 

ACTGGTTCAG TAGCCAGCTG ACAGCCACCT GCAACGTGGA GCAGTCCTTC 
TTCAACGACT 114 0 

GGTTCAGTGG ACACCTTAAC TTCCAGATTG AGCACCACCT CTTCCCCACC 
ATGCCCCGGC 1200 

ACAACTTACA CAAGATCGCC CCGCTGGTGA AGTCTCTATG TGCCAAGCAT 
GGCATTGAAT 1260 

ACCAGGAGAA GCCGCTACTG AGGGCCCTGC TGGACATCAT CAGGTCCCTG 
AAGAAGTCTG 1320 

GGAAGCTGTG GCTGGACGCC TACCTTCACA AATGAAGCCA CAGCCCCCGG 
GACACCGTGG 1380 

GGAAGGGGTG CAGGTGGGGT GATGGCCAGA GGAATGATGG GCTTTTGTTC 
TGAGGGGTGT 144 0 

CCGAGAGGCT GGTGTATGCA CTGCTCACGG AC CCCATGTT GGATCTTTCT 
CCCTTTCTCC 1500 

TCTCCTTTTT CTCTTCACAT CTCCCCCATA GCACCCTGCC CTCATGGGAC 
CTGCCCTCCC 1560 

TCAGCCGTCA GCCATCAGCC ATGGCCCTCC CAGTGCCTCC TAGCCCCTTC 
TTCCAAGGAG 1620 

CAGAGAGGTG GCCACCGGGG GTGGCTCTGT CCTACCTCCA CTCTCTGCCC 
CTAAAGATGG 168 0 

GAGGAGACCA GCGGTCCATG GGTCTGGCCT GTGAGTCTCC CCTTGCAGCC 
TGGTCACTAG 174 0 

GCATCACCCC CGCTTTGGTT CTTCAGATGC TCTTGGGGTT CATAGGGGCA 
GGTCCTAGTC 1800 

GGGCAGGGCC CCTGACCCTC CCGGCCTGGC TTCACTCTCC CTGACGGCTG 
CCATTGGTCC 1860 

AC C CTTT CAT AGAGAGGCCT GCTTTGTTAC AAAGCTCGGG TCTCCCTCCT 
GCAGCTCGGT 1920 

TAAGTACCCG AGGCCTCTCT TAAGATGTCC AGGGCCCCAG GCCCGCGGGC 
ACAGCCAGCC 1980 

CAAACCTTGG GCCCTGGAAG AGTCCTCCAC CCCATCACTA GAGTGCTCTG 
ACCCTGGGCT 204 0 

TTCACGGGCC CCATTCCACC GCCTCCCCAA CTTGAGCCTG TGACCTTGGG 
AC CAAAG GGG 2100 

GAGTCCCTCG TCTCTTGTGA CTCAGCAGAG GCAGTGGCCA CGTTCAGGGA 
GGGGCCGGCT 2160 
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GGCCTGGAGG CTCAGCCCAC CCTCCAGCTT TTCCTCAGGG TGTCCTGAGG 
TCCAAGATTC 2220 

TGGAGCAATC TGACCCTTCT CCAAAGGCTC TGTTATCAGC TGGGCAGTGC 
CAGCCAATCC 2280 

CTGGCCATTT GGCCCCAGGG GACGTGGGCC CTGCAGGCTG CAGGAGGGCA 
CTGGAGCTGG 234 0 

GAGGTCTCGT CCCAGCCCTC CCCATCTCGG GGCTGCTGTG TGGACGGCGC 
TGCCTCAGGC 2400 

ACTCTCCTGT CTGAACCTGC CCTTACTGTG TTTAACCTGT TGCTCCAGGA 
TGCATTCTGA 24 60 

TAGGAGGGGG CGGCAGGGCT GGGCCTTGTG ACAATCTGCC TTTCACCACA 
TGGCCTTGCC 2 520 

TCGGTGGCCC TGACTGTCAG GGAGGGCCAG GGAGGCAGAG CGGGAGGGAG 
TCTCAGGAGG 2580 

AGGCTGCCCT GAGGGGCTGG GGAGGGGGTA CCTCATGAGG AC CAGGGT GG 
AGCTGAGAAG 2 64 0 

AGGAGGAGGT GGGGGCTGGA GGTGCTGGTA GCTGAGGGGA CGGGCAAGTG 
AGAGGGGAGG 27 00 

GAGGGAAGTC CTGGGAGGAT CCTGAGCTGC TGTTGCAGTC TAACCCACTA 
ATCAGTTCTT 2760 

AGATTCAGGG GAAGGGCAGG CACCAACAAC TCAGAATGGG GGCTTTCGGG 
GAGGGCGCCT 2 620 

AGTCCCCCCA GCTCTAAGCA GCCAGGAGGG ACCTGCATCT AAGCATCTGG 
GTTGCCATGG 2880 

CAATGGCATG CCCCCCAGCT ACTGTATGCC CCCGACCCCC GCAGAGGCAG 
AATGAACCCA 294 0 

TAGGGAGCTG ATCGTAATGT TTATCATGTT ACTTCCCCAC CCCTACATTT 
TTTGAAATAA 3000 

AAT AAG GAAT TTTATTCTCA CTTCCTGTGT TTCCTGCACG CCAATGCCAG 
GCCATGGTAT 3060 

TGGGTGATAG ATGAGGCCCT TCTAGCTGGG CCTGGGCACC AGGAGGGGTC 
CCCATGCTTG 3120 

CATCTCTCTG TATCCCCTCC CTCCCCTGTG GCCATCCCAC CCGCCTCTCC 
CTGCTGCCTC 3180 

TGAAATTCAT TCTGGGGCCC GGAACTTGGT GGAAATGACC CAAAAACATT 
GGCCCATCTT 324 0 

CCTCCTCTCA GCAGCCGACC CCAGCCCAAT TCTAAAACAG GGCTGAGAGC 
CACCTCTCAG 3300 

CAGCTGACCC CTACCCAAGG AGGGTGGCAT GGAG GGGCTT GCAGAGACTC 
TTCCTAACAT 3360 

CCTCCCCCCC CAGCTGTCTC CCCAAGTGCA ATCTGCCCTC CCATCCCTGG 
GCCAGCCAGC 3420 
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TTCCACAGAG CGCCAGGCCA AACAGAATTC CTGGCCTCCT TGGAAGGGGC 
TGGAGAAGGC 34 80 

CGGGAGCAGT GGCTCACGCC TGTAATCCCA GCACTTTGGG AGGCTGAGGC 
GGGCAGATCA 354 0 

CAAAGTCAAG AGATT GAGAC CATCCTGGCC AACATGGTGA AACCCCGTCT 
CTACTAAAAA 3600 

TACAAAAATT AGGCCGGGTG CGGTGGCTCA CGCCTGTAAT CCCAGCACTT 
TGGGAGGCCG 3660 

AGGCGGGCAG ATCACGAGGT CAGGAGATCA AGACCATCCT GGCTAACACG 
GTGAAACCCC 3720 

GTCTCTACTA AAAATACAAA AAATTAGCTG GGCGAGGTGG CGGGTGCCTG 
TAGTCCCAGC 37 80 

TACGTGGGAG GCTGAGGCAA GAGAATGGCG TGAACCCCGG CGGGGCAGAG 
20 CCTGCAGAGA 384 0 

GCTGAGATCA CACCACTGTA CTCCAGCCTG GGCGACAGCG AGACTCCGTC 
TCAAAAAAAA 3900 

25 AAAAAAAAAA AATTAGCTGG GCATGGTGGT GCGTGCCTGC AGTCCCAGCT 

ACTCAGGAGG 3960 

CTGAGACGGG AGAATCGCTT GAACCTGGGA GGCAGAGGTT GCAGTGAGCC 
AAGATCGCTC 4 020 
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ACTCCAGCCT AGC GACAGAG TGAGACTCCA TCTCAAATAA ATAAATAAAT 
TAATTAATTA 4 080 

AATTAAATT 4 089 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1757 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



<ix> FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION : 1 . .1757 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GGCCGCGGCG GCAGGGCGGG GCCGGAGCAG CGGGCGGCGG CGGAGGCGGC 
GCCCGGGAGC 60 

GCTCTTCGCT TCCCTCGGGG TCTTGCTCGG ACCTCGGCCA CCGCCTGGGA 
TCCCCAGGAC 120 

TCGTGCGTGC AGCATGGGCG GCGTCGGGGA GCCGGGACCG CGGGAGGGAC 
CCGCGCAGCC 180 

GGGGGCACCG CTGCCCACCT TCTGCTGGGA GCAGATCCGC GCGCACGACC 
AGCCCGGCGA 240 

CAAGTGGCTG GTCATCGAGC GCCGCGTCTA CGACATCAGC CGCTGGGCAC 
AGCGGCACCC 300 

AGGGGGCAGC CGCCTCATCG GCCACCACGG CGCTGAGGAC GCCACGGATG 
CCTTCCGTGC 360 

CTTCCATCAA GATCTCAATT TTGTGCGCAA GTTCCTACAG CCCCTGTTGA 
TTGGAGAGCT 420 

GGCTCCGGAA GAACCCAGCC AGGATGGACC CCTGAATGCG CAGCTGGTCG 
AGGACTTCCG 4 80 

AGCCCTGCAC CAGGCAGCCG AGGACATGAA GCTGTTTGAT GCCAGTCCCA 
CCTTCTTTGC 54 0 

TTTCCTACTG GGCCACATCC TGGCCATGGA GGTGCTGGCC TGGCTCCTTA 
TCTACCTCCT 600 

GGGTCCTGGC TGGGTGCCCA GTGCCCTGGC CGCCTTCATC CTGGCCATCT 
CTCAGGCTCA 660 

GTCCTGGTGT CTGCAGCATG ACCTGGGCCA TGCCTCCATC TTCAAGAAGT 
CCTGGTGGAA 720 

CCACGTGGCC CAGAAGTTCG T GAT GGGGCA GCTAAAGGGC TTCTCCGCCC 
ACTGGTGGAA 780 

CTTCCGCCAC TTCCAGCACC ACGCCAAGCC CAACATCTTC CACAAAGACC 
CAGACGTGAC 840 

GGTGGCGCCC GTCTTCCTCC TGGGGGAGTC ATCCGTCGAG TATGGCAAGA 
AGAAACGCAG 900 
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ATACCTACCC TACAACCAGC AGCACCTGTA CTTCTTCCTG ATCGGCCCGC 
CGCTGCTCAC 960 

5 CCTGGTGAAC TTTGAAGTGG AAAATCTGGC GTACATGCTG GTGTGCATGC 

AGTGGGCGGA 1020 

TTTGCTCTGG GCCGCCAGCT TCTATGCCCG CTTCTTCTTA TCCTACCTCC 
CCTTCTACGG 108 0 

10 

CGTCCCTGGG GTGCTGCTCT TCTTTGTTGC TGTCAGGGTC CTGGAAAGCC 
ACTGGTTCGT 1140 

GTGGATCACA CAGATGAACC ACATCCCCAA GGAGATCGGC CACGAGAAGC 
75 ACCGGGACTG 12 00 

GGTCAGCTCT CAGCTGGCAG CCACCTGCAA CGTGGAGCCC TCACTTTTCA 
CCAACTGGTT 1260 

20 CAGCGGGCAC CTCAACTTCC AGATCGAGCA CCACCTCTTC CCCAGGATGC 

CGAGACACAA 1320 

CTACAGCCGG GTGGCCCCGC TGGTCAAGTC GCTGTGTGCC AAGCAC G GCC 
TCAGCTACGA 1380 

25 AGTGAAGCCC TTCCTCACCG CGCTGGTGGA CATCGTCAGG TCCCTGAAGA 

AGTCTGGTGA 14 40 

CATCTGGCTG GACGCCTACC TCCATCAGTG AAGGCAACAC CCAGGCGGGC 
AGAGAAGGGC 1500 

30 

TCAGGGCACC AGGAACCAAG CCAGCCCCGG CGGGATCGAT ACCCCCACCC 
CTCCACTGGC 1560 

CAGCCTGGGG GTGCCCTGCC TGCCCTCCTG GTACTGTTGT CTTCCCCTCG 
35 GCCCCCTCAC 1620 

ATGTGTATTC AGCAGCCCTA TGGCCTTGGC TCTGGGCCTG ATGGGACAGG 
GGTAGAGGGA 1680 

40 AGGT GAGCAT AGCACATTTT CCTAGAGCGA GAATTGGGGG AAAGCTGTTA 

TTTTTATATT 1740 

AAAATACATT CAGATGT 
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INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE : 

(A) NAME/ KEY: Protein 

(B) LOCATION: 1 . .444 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Pro Asp Pro Val Ala Ala Glu Thr Ala Ala Gin Gly Pro Thr 

10 15 

Pro Arg Tyr Phe Thr Trp Asp Glu Val Ala Gin Arg Ser Gly Cys Glu 
20 25 30 

Glu Arg Trp Leu Val He Asp Arg Lys Val Tyr Asn He Ser Glu Phe 
35 40 45 

Thr Arg Arg His Pro Gly Gly Ser Arg Val He Ser His Tyr Ala Gly 
50 55 60 

Gin Asp Ala Thr Asp Pro Phe Val Ala Phe His He Asn Lys Gly Leu 
65 70 75 " 60 

Val Lys Lys Tyr Met Asn Ser Leu Leu He Gly Glu Leu Ser Pro Glu 
85 90 95 

Gin Pro Ser Phe Glu Pro Thr Lys Asn Lys Glu Leu Thr Asp Glu Phe 
100 105 HO 

Arg Glu Leu Arg Ala Thr Val Glu Arg Met Gly Leu Met Lys AJ a Asn 
115 120 125 

His Val Phe Phe Leu Leu Tyr Leu Leu His He Leu Leu Leu Asp Gly 
130 135 140 

Ala Ala Trp Leu Thr Leu Trp Val Phe Gly Thr Ser Phe Leu Pro Phe 
14 5 150 155 160 

Leu Leu Cys Ala Val Leu Leu Ser Ala Val Gin Ala Gin Ala Gly Trp 
lfc5 170 175 

Leu Gin His Asp Phe Gly His Leu Ser Val Phe Ser Thr Ser Lys Trp 
180 185 190 

Asn His Leu Leu His His Phe Val He Gly His Leu Lys Gly Ala Pro 
195 200 ~ 205 

Ala Ser Trp Trp Asn His Met His Phe Gin His His Ala Lys Pro Asn 
210 215 220 

Cys Phe Arg Lys Asp Pro Asp He Asn Met His Pro Phe Phe Phe Ala 
225 230 235 240 
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Leu Gly Lys He Leu Ser Val Glu Leu Gly Lys Gin Lys Lys Lys Tyr 
245 250 " 255 

Met Pro Tyr Asn His Gin His Lys Tyr Phe Phe Leu He Gly Pro Pro 

260 265 270 

Ala Leu Leu Pro Leu Tyr Phe Gin Trp Tyr He Phe Tyr Phe Val He 

275 280 285 

Gin Arg Lys Lys Trp Val Asp Leu Ala Trp Met Tie Thr Phe Tyr Val 
290 295 300 

Arg Phe Phe Leu Thr Tyr Val Pro Leu Leu Gly Leu Lys Ala Phe Leu 
305 310 315 320 

Gly Leu Phe Phe He Val Arg Phe Leu Glu Ser Asn Trp Phe Val Trp 

325 330 335 

Val Thr Gin Met Asn His He Pro Met His He Asp His Asp Arg Asn 
340 345 350 

Met Asp Trp Val Ser Thr Gin Leu Gin Ala Thr Cys Asn Val His Lys 

355 360 365 

Ser Ala Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu 
370 375 380 

His His Leu Phe Pro Thr Met Pro Arg His Asn Tyr His Lys Val Ala 
385 390 395 ' " 400 

Pro Leu Val Gin Ser Leu Cys Ala Lys His Gly He Glu Tyr Gin Ser 
405 410 415 

Lys Pro Leu Leu Ser Ala Phe Ala Asp He He His Ser Leu Lys Glu 
35 420 425 430 

ser Gly Gin Leu Trp Leu Asp Ala Tyr Leu His Gin 
435 ' 440 

40 



45 



50 



55 
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BNSDOCID: <EP 1035207A1_I_> 



23 



EP 1 035 207 A1 



) INFORMATION FOR SEQ ID NO: S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



{ i X ) FEATU RE : 

(A) NAME/ KEY : Protein 

(B) LOCATION : 1 . .444 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Giy Lys Gly Gly Asn Gin Gly Glu Gly Ala Ala Glu Arg Glu Val 
15 10 is 

Ser Val Pro Thr Phe Ser Trp Glu Glu He Gin Lys His Asn Leu Arg 
2 ° 25 30 

Thr Asp Arg Trp Leu Val Tie Asp Arg Lys Val Tyr Asn Tie Thr Lys 
3i > 40 45 

Trp Ser lie Gin His Pro Gly Gly Gin Arg Val He Gly His Tyr Ala 
*0 5b b0 

Gly Glu Asp Ala Thr Asp Ala Phe Arg Ala Phe His Pro Asp Leu Glu 
65 70 75 80 

Phe Val Gly Lys Phe Leu Lys Pro Leu Leu He Gly Glu Leu Ala Pro 
65 yu 9S 

Glu Glu Pro Ser Gin Asp His Gly Lys Asn Ser Lys He Thr Glu Asp 
100 105 no 

Phe Arg Ala Leu Arg Lys Thr Ala Glu Asp Met Asn Leu Phe Lys Thr 
115 120 125 

Asn His Val Phe Phe Leu Leu Leu Leu Ala His He He Ala Leu Glu 
130 135 140 

Ser He Ala Trp Phe Thr Val Phe Tyr Phe Gly Asn Gly Trp He Pro 
145 1*0 155 ^ 160 

Thr Leu He Thr Ala Phe Val Leu Ala Thr Ser Gin Ala Gin Ala Gly 
1*>S 170 175 

Trp Leu Gin His Asp Tyr Gly His Leu Ser Val Tyr Arg Lys Pro Lvs 
180 185 190 

Trp Asn His Leu Val His Lys Phe Val He Gly His Leu Lys Gly Ala 
195 200 205 

Ser Ala Asn Trp Trp Asn His Arg His Phe Gin His His Ala Lys Pro 
210 215 220 

Asn He Phe His Lys Asp Pro Asp Val Asn Met Leu His Val Phe Val 
225 2 30 235 240 
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Leu Gly Glu Trp Gin Pro He Glu Tyr Gly Lys Lys Lys Leu Lys Tyr 
24 5 250 " 255 

Leu Pro Tyr Asn His Gin His Glu Tyr Phe Phe Leu He Gly Pro Pro 

2bu 265 27U 

Leu Leu He Pro Met Tyr Phe Gin Tyr Gin He lie Met Thr Met He 
275 280 285 

Va3 His Lys Asn Trp Val Asp Leu Ala Trp Ala Val Ser Tyr Tyr He 
290 295 300 

Arg Phe Phe He Thr Tyr He Pro Phe Tyr Gly He Leu Gly Ala Leu 
305 310 315 ^ 320 

Leu Phe Leu Asn Phe He Arg Phe Leu Glu Ser His Trp Phe Val Trp 

325 33u 335 

Val Thr Gin Met Asn His He Val Met Glu lie Asp Gin Glu Ala Tyr 
34 0 345 350 

Arg Asp Trp Phe Ser Ser Gin Leu Thr Ala Thr Cys Asn Val Glu Gin 
355 360 365 

25 Ser Phe Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu 

370 37b 380 

His His Leu Phe Pro Thr Met Pro Arg His Asn Leu His Lys He Ala 
385 390 395 400 
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Pro Leu Val Lys Ser Leu Cys Ala Lys His Gly He Glu Tyr Gin Glu 
405 410 " 415 

Lys Pro Leu Leu Arg Ala Leu Leu Asp He He Arg Ser Leu Lys Lys 
42U 425 ^ 430 

Ser Gly Lys Leu Trp Leu Asp Ala Tyr Leu His Lys 
435 440 
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(2) INFORMATION FOR SEQ 3D NO: 6: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH r 44b amino acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/ KEY : Protein 

(B) LOCATION: i .. 445 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Gly Gly Val Gly Glu Pro Gly Pro Arg Glu Giy Pro Ala Gin Pro 
15 10 15 

Gly Ala Pro Leu Pro Thr Phe Cys Trp Glu Gin He Arg Ala His Asp 

20 25 30 

Gin Pro Gly Asp Lys Trp Leu Val Tie Glu Arg Arg Val Tyr Asp He 
35 4 0 4 5 

Ser Arg Trp Ala Gin Arg His Pro Gly Gly Ser Arg Leu He Gly His 
50 55 60 

His Gly Ala Glu Asp Ala Thr Asp Ala Phe Arg Ala Phe His Gin Asp 

65 70 75 80 

Leu Asn Phe Val Arg Lys Phe Leu Gin Pro Leu Leu He Gly Glu Leu 
85 90 95 

Ala Pro Glu Glu Pro 5er Gin Asp Gly Pro Leu Asn Ala Gin Leu Val 
100 105 110 

Glu Asp Phe Arg Ala Leu His Gin Ala Ala Glu Asp Met Lys Leu Phe 
115 120 125 

Asp Ala Ser Pro Thr Phe Phe Ala Phe Leu Leu Gly His He Leu Ala 
40 130 135 140 

Met Glu Val Leu Ala Trp Leu Leu He Tyr Leu Leu Gly Pro Gly Trp 
14b 150 155 160 



Val Pro Ser Ala Leu Ala Ala Phe He Leu Ala He Ser Gin Ala Gin 
165 170 175 

Ser Trp Cys Leu Gin His Asp Leu Gly His Ala Ser He Phe Lys Lys 
180 185 190 

so Ser Trp Trp Asn His Val Ala Gin Lys Phe Val Met Gly Gin Leu Lys 

195 200 205 

Gly Phe Ser Ala His Trp Trp Asn Phe Arg His Phe Gin His His Ala 
210 215 220 



55 



Lys Pro Asn He Phe His Lys Asp Pro Asp Val Thr Val Ala Pro Val 

225 230 235 240 
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Phe Leu Leu Gly Glu Ser Ser Val Glu Tyr Gly Lys Lys Lys Arg Arg 

245 250 " * ~ 255 

Tyr Leu Pro Tyr Asn Gin Gin His Leu Tyr Phe Phe Leu He Gly Pro 

260 265 270 



10 



Pro Leu Leu Thr Leu Val Asn Phe Glu Val Glu Asn Leu Ala Tyr Met 

275 2B0 285 

Leu Val Cys Met: Gin Trp Ala Asp Leu Leu Trp Ala Ala Ser Phe Tyr 

290 295 300 



75 



Ala Arg Phe Phe Leu Ser Tyr Leu Pro Phe Tyr Gly Val Pro Gly Val 
305 310 315 ' 320 

Leu Leu Phe Phe Val Ala Val Arg Val Leu Glu Ser His Trp Phe Val 
325 330 33b 



20 



Trp lie Thr Gin Met Asn His lie Pro Lys Glu He Gly His Glu Lys 
34U 345 350 



His Arg Asp Trp Val Ser Ser Gin Leu Ala Ala Thr Cys Asn Val Glu 
355 360 365 



25 



Pro Ser Leu Phe Thr Asn Trp Phe Ser Gly His Leu Asn Phe Gin He 
370 375 380 



30 



Glu His His Leu Phe Pro Arg Met Pro Arg His Asn Tyr Ser Arg Val 

38o 390 395 400 

Ala Pro Leu Val Lys Ser Leu Cys Ala Lys His Gly Leu Ser Tyr Glu 

40b 410 415 



35 



Val Lys Pro Phe Leu Thr Ala Leu Val Asp He Val Arg Ser Leu Lys 
420 425 430 

Lys Ser Gly Asp He Trp Leu Asp Ala Tyr Leu His Gin 
435 440 " 445 



40 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: exon 

( B ) LOCATION:!. .lb 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TCGTCCCCGA CAGTCCGC 18 

(2) INFORMATION FOR SEQ ID NO: 8: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



30 (ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION : 1 . .20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TAAAACACTA ACCGGTGGAC 20 

(2) INFORMATION FOR SEQ ID NO: 9: 

(j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 
60 (A) NAME/ KEY : exon 

(B) LOCATION : I . . 22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: S>: 
55 ATCGTCTACC AAGGT GAAAC TC 22 



^NSDOCID: <EP 1035207A1 J_> 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
I A) LENGTH: 20 base pairs 
IB) TYPE: nucieic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION : 1 • .20 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCTTGTACCT GACCCAAAGG 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION : 1 . .21 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TGATACTTCT TTTCCGGGTC C 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
(Bj TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION:!. .19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TCCGTCTCCC CTCGAACTC 
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15 



20 



35 



40 



45 



SO 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acici 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE; 

(A) NAME/KEY: exon 

(B) LOCATION: 1, .19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CGCGTCCAAT ACGAAGACT 19 

(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



30 (ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION : 1 . .20 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GGACGACTAA CCACTTGACC 20 

(2) I N FORMAT I ON FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE : 

(A) NAME/ KEY: exon 

(B) LOCATION :1. .20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCAAGTGGTT AGTCGTCCCC 20 



NSDOCID: <EP 1035207A1J_> 
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(2) INFORMATION FOR SEQ ID NO: lb: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iX) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION : 1 . .20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGACTTTATG GACGGGATGT 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



<ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION:!. .19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AACGGCATGA GCTACCCGA 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION : 1 . .20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IB: 
CTACCCGAAA ACAAGAC TCC 



1035207A1_I_> 
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40 



18 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 18 base pairs 
\h) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(Dj TOPOLOGY: iineax 

(ii) MOLECULE TYPE: DNA (genomic) 

(iX) FEATURE: 

(A) NAME/ KEY: exon 
<B) LOCATION: i. . 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IS: 
CGGCTCCAGG CTCG1TCT 

\2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
tD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION:!. .20 

3S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GTGACCAAGC ACACCTAGTG ?Q 



10 



IS 



20 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 21; 



li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
4S (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA {genomic) 



(ix) FEATURE : 
SO (A) NAME/ KEY: exon 

( B) LOCATION:!.. 19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
55 GTAGGACCGA CCCAAGAAG 19 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: exon 
1S (B) LOCATION: 1. .20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
20 CTCTTAACCC CCTTTCGACA 20 



25 



30 



35 



40 



45 



SO 



55 
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SEQUENCE LISTING 

5 <110> MultiGene Biotech GmbH 

<120> cDNA molecules of the members of gene family encoding 

human fatty acid desaturases and their use in diagnosis 
10 and therapy 

<130> MultiGene Anm. 99/002 EP 



IS 



20 



25 



30 



<140> 99104664.0-2105 
<141> 1999-03-09 

<160> 22 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 4205 
<212> DNA 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: 
35 unknown 
<400> 1 

cactcctgga gcccgcggac cccgagcacg cgcctgacag cccctgctgg cccggcgcgc 60 
40 ggcgtcgcca ggccagctat ggcccccgac ccggtggccg ccgagaccgc ggctcagcga 120 

cctaccccgc gctacttcac ctgggacgag gtggcccagc gctcagggtg cgaggagcgg 180 
tgcctagtga tcgaccgtaa ggtgtacaac atcagcgagt tcacccgccg gcatccaggg 240 
ggctcccggg tcatcagcca ctacgccggg caggatgcca cggatccctt tgtggccttc 300 
45 cacatcaaca agggccttgt gaagaagtat atgaactctc tcctgattgg agaactgtct 360 

ccagagcagc ccagctttga gcccaccaag aataaagagc tgacagatga gttccgggag 420 
ctgcgggcca cagtggagcg gatggggctc atgaaggcca accatgtctt cttcctgctg 480 
tacctgctgc acatcttgct gctggatggt gcagcctggc tcaccctttg ggtctttggg 540 
acgtcctttt tgcccttcct cctctgtgcg gtgctgctca gtgcagttca ggcccaggct 600 
ggctggctgc agcatgactt tgggcacctg tcggtcttca gcacctcaaa gtggaaccat 660 
ctgctacatc attttgtgat tggccacctg aagggggccc ccgccagttg gtggaaccac 720 
atgcacttcc agcaccatgc caagcccaac tgcttccgca aagacccaga catcaacatg 780 
catcccttct tctttgcctt ggggaagatc ctctctgtgg agcttgggaa acagaagaaa 84 0 



SO 
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aaatatatgc cgtacaacca ccagcacaaa tacttcttcc taattgggcc cccagccttg 900 
ctgcctctct acttccagtg gtatattttc tattttgtta tccagcgaaa gaagtgggtg 960 
gacttggcct ggatgattac cttctacgtc cgcttcttcc tcacttatgt gccactattg 1020 
gggctgaaag ccttcctggg ccttttcttc atagtcaggt tcctggaaag caactggttt 1080 
gtgtgggtga cacagatgaa ccatattccc atgcacattg atcatgaccg gaacatggac 1140 
tgggtttcca cccagctcca ggccacatgc aatgtccaca agtctgcctt caatgactgg 1200 
ttcagtggac acctcaactt ccagattgag caccatcttt ttcccacgat gcctcgacac 1260 
aattaccaca aagtggctcc cctggtgcag tccttgtgtg ccaagcatgg catagagtac 1320 
cagtccaagc ccctgctgtc agccttcgcc gacatcatcc actcactaaa ggagtcaggg 1380 
cagctctggc tagatgccta tcttcaccaa taacaacagc caccctgccc agtctggaag 1440 
aagaggagga agactctgga gccaaggcag aggggagctt gagggacaat gccactatag 1500 
tttaatactc agagggggtt gggtttgggg acataaagcc tctgactcaa actcctccct 1560 
tttatcttct agccacagtt ctaagaccca aagtgggggg tggacacaga agtccctagg 1620 
agggaaggag ctgttggggc aggggtgtaa attatttcct ttttctagtt tggcacatgc 1680 
aggtagttgg tgaacagaga gaaccaggag ggtaacagaa gaggagggac ctactgaacc 1740 
cagagrcagg aagagattta acactaaaat tccactcatg ccgggcgtgg tgcacgcgcc 1800 
tgtaaiccca gctacccagg aggctgaggc aggagaatcg cttgaaccgg ggaggtggag 1860 
gttgcagtga gctgagatca cgccattgta ctccagcctg ggcgacagag caagactcca 1920 
tttcaaaaaa aaaaaaaaaa aaaaaaatcc actcatataa aaggtgagct cagctcactg 1980 
gtccatttct cagtggcttc tccatcctca tttgcaaacc tcagagggat aaggcagttg 2040 
aacctgatga gcaagaatta taacagcaag gaaacattaa tgcttagaat tctgagatcc 2100 
agcacaactc agtctgtggg agctcagctc gctgcccagg gataggtatg acctatgtct 2160 
gccttaggct gctgggagat gccattctcc agtttcagaa gcaggcaggg caaaggtcaa 2220 
gactgtggta ttggggtctt ttggctctga aggatcctgg aaccactgat tttggtttat 2280 
tccctccagg gtctaaagag aacaagaggt gctagctctt accaaaacag atggtagaga 2340 
gagttgctgg ctatttaaaa agctctttca tcttttaatt cacctcttct tttcacctct 2400 
ttaaccactc ctcaggaaca gaacacttct aggactgggg gtcttttagc tccataagca 2460 
agtgagcaga tgggacaagt tagtcttttc tccctagaaa caaaggggat gcccagtggt 2520 
ttccctttgc ttcccaacct aaaatttcaa gtttaauaa atagcaatta gcagaagtga 2580 
ccaaattggg agataattat cagtcatgag gaaagacaca gatttcggtc ataaagaatg 2640 
taagggctat aagtagaaac tttctataac ctaaatgatg ttatagaatt atttttgagc 2700 
aggagcagaa agattaaata tgatcacttc atacttctaa atcagaaata ggaagattaa 2760 
aaccacagaa cagtttgtga tttctattgc tggtagctag gtatcttact ctgtccactc 2820 
ttgttcaagt atctaactct tctggaaacc aaataggctt tagaagagat tatcctatat 2880 
tcctatcagt ataatactaa aatgtaactt tttaatcatc tggtttttaa aagataaaca 2940 
gtttagccca tctctccaga gagcaaacat aggaatatga ctcaggagcc tcctagggct 3000 
tatcatcagc cctcacaccc gcttccccct ccaacccaca gcctttgctt ccaggtggca 3060 
ggattactac tttgcctctt cagcagcatc tactctaggc atattgatca ttttagacac 3120 
tgggagaaga gaacctcaaa ctaggaggaa aagacagagc ctccacttag ttttgggagg 3180 
ggatggcaga cagtcaagga gatgagcgtc ctaaggcatg ttgggatagg gtcagatgca 3240 
ccacccatgg agaggtttgt caacacaaag acatggaagg ttagaggttt gtcaacaaaa 3300 
agacatggaa ggttaggttt gtcaacacaa agacatggaa gattagaggt ttgtcaacac 3360 
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aaagatacag gaagaatggg ctgcagaaga tttagatgtt ttccatttgg gcacatttta 3420 

cttagctgga gaactaggtt taaaacagcc rcggtaggaa aattagaagc aagctggatg 3480 

cagtggctca tgcctgtaat cccaacactt ttgggaggtc caggcaggag gatcacttgg 3540 

gcccaggagg tcaagcctgc agcgagctga gatcacacca ctgcactcca gcctggggtg 3600 

atagaacaag accctgtctc aaaaaaaaaa aaaaacaaca aaaacttaga attgaggagt 3660 

tgtacctcca ttggcttcct cactccaaaa taggtgctga tccttcctat tcctattctt 3720 

tgccaccttt tgggtgtggt gtcaccagcc tgtttagcca agtagctttg ggcataggct 3780 

gcccaatctg agcaaacacc agtgaggctc tattgagcaa gaccaagtcc tcaaagcacc 3840 

tgaaccactg tggccttctc agcctacagc . agtgtggtct cttacatggc cacaaaggga 3900 

cacacagtga caaaaggctc ggaatgttac aatggtaaaa tgagtgatct caaatccact 3960 

gacagatata aaataggctt agagaggaaa agctgcctct ggtcaagtag atcatggcag 4020 

catgaattcc aactcacttt tttacgaact ccaacttcta tgtttatctt tgttactttc 4080 

acttttttac aacctgncag aggcattttt taaatcaggc ccaatatcag tattcttttt 4140 

gtgtgtgcca attttgttat cacatcccta tgaagttgaa aaataaagtt aattttgacc 4200 

aaaag 4205 

<210> 2 
<211> 4089 
<212> DNA 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: 

unknown 
<400> 2 

cgtcacagtc ggcaggcagc atggggaagg 
gcgaggtgtc ggtgcccacc ttcagctggg 
acaggtggct ggtcattgac cgcaaggttt 
cggggggcca gcgggtcatc gggcactacg 
ccttccaccc tgacctggaa ttcgtgggca 
tggccccgga ggagcccagc caggaccacg 
gggccctgag gaagacggct gaggacatga 
tcctcctcct ggcccacatc atcgccctgg 
ttggcaatgg ctggattcct accctcatca 
aagctggatg gctgcaacat gattatggcc 
accaccttgt ccacaaattc gtcattggcc 
atcatcgcca cttccagcac cacgccaagc 
acatgctgca cgtgtttgtt ctgggcgaat 
tgaaatacct gccctacaat caccagcacg 
tcatccccat gtatttccag taccagatca 



gagggaacca gggcgagggg gccgccgagc 60 
aggagattca gaagcataac ctgcgcaccg 120 
acaacatcac caaatggtcc atccagcacc 180 
ctggagaaga tgcaacggat gccttccgcg 240 
agttcttgaa acccctgctg attggtgaac 300 
gcaagaactc aaagatcact gaggacttcc 360 
acctgttcaa gaccaaccac gtgttcttcc 420 
agagcattgc atggttcact gtcttttact 480 
cggcctttgt ccttgctacc tctcaggccc 540 
acctgtctgt ctacagaaaa cccaagtgga 600 
acttaaaggg tgcctctgcc aactggtgga 660 
ctaacatctt ccacaaggat cccgatgtga 720 
ggcagcccat cgagtacggc aagaagaagc 780 
aatacttctt cctgattggg ccgccgctgc 840 
tcatgaccat gatcgtccat aagaactggg 900 
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tggacctggc ctgggccgtc agctactaca tccggttctt catcacctac atccctttct 960 

acggcatcct gggagccctc cttttcctca acctcatcag gttcctggag agccactggt 1020 

ttgtgtgggt cacacagatg aatcacatcg tcatggagat tgaccaggag gcctaccgtg 1080 

actggttcag tagccagctg acagccacct gcaacgtgga gcagtccttc ttcaacgact 1140 

ggttcagtgg acaccttaac ttccagattg agcaccacct cttccccacc atgccccggc 1200 

acaacttaca caagatcgcc ccgctggtga agtctctatg tgccaagcat ggcattgaat 1260 

accaggagaa gccgctactg agggccctgc tggacatcat caggtccctg aagaagtctg 1320 

ggaagctgtg gctggacgcc taccttcaca aatgaagcca cagcccccgg gacaccgtgg 1380 

ggaaggggtg caggtggggt gatggccaga ggaatgatgg gcttttgttc tgaggggtgt 1440 

ccgagaggct ggtgtatgca ctgctcacgg accccatgtt ggatctttct ccctttctcc 1500 

tctccttttt ctcttcacat ctcccccata gcaccctgcc ctcatgggac ctgccctccc 1560 

tcagccgtca gccatcagcc atggccctcc cagtgcctcc tagccccttc ttccaaggag 1620 

cagagaggtg cccaccgggg gtggctctgt cctacctcca ctctctgccc ctaaagatgg 1680 

gaggagacca ccggtccatg ggtctggcct gtgagtctcc ccttgcagcc tggtcactag 1740 

gcatcacccc cgctttggtt cttcagatgc tcttggggtt cataggggca ggtcctagtc 1800 

gggcagggcc cctgaccctc ccggcctggc ttcactctcc ctgacggctg ccattggtcc 1860 

accctttcat agagaggcct gctttgttac aaagctcggg tctccctcct gcagctcggt 1920 

taagtacccg aggcctctct taagatgtcc agggccccag gcccgcgggc acagccagcc 1980 

caaaccttgg gccctggaag agtcctccac cccatcacta gagtgctctg accctgggct 2040 

ttcacgggcc ccattccacc gcctccccaa cttgagcctg tgaccttggg accaaagggg 2100 

gagtccctcg tctcttgtga ctcagcagag gcagtggcca cgttcaggga ggggccggct 2160 

ggcctggagg ctcagcccac cctccagctt ttcctcaggg tgtcctgagg tccaagattc 2220 

tggagcaatc tcacccttct ccaaaggctc tgttatcagc tgggcagtgc cagccaatcc 2280 

ctggccattt gcccccaggg gacgtgggcc ctgcaggctg caggagggca ctggagctgg 2340 

gaggtctcgt cccagccctc cccatctcgg ggctgctgtg tggacggcgc tgcctcaggc 2400 

actctcctgt ctgaacctgc ccttactgtg tttaacctgt tgctccagga tgcattctga 2460 

taggaggggg cggcagggct gggccttgtg acaatctgcc tttcaccaca tggccttgcc 2520 

tcggtggccc tgactgtcag ggagggccag ggaggcagag ccggagggac tctcaggagg 2580 

aggctgccct gaggggctgg ggagggggta cctcatgagg accagggtgg agctgagaag 2640 

aggacgaggt gggggctgga ggtgctggta gctgagggga cgggcaagtg agaggggagg 2700 

gagggaagtc ctgggaggat cctgagctgc tgttgcagtc taacccacta atcagttctt 2760 

agattcaggg gaagggcagg caccaacaac tcagaatggg ggctttcggg gagggcgcct 2820 

agtcccccca gctctaagca gccaggaggg acctgcatct aagcatctgg gttgccatgg 2880 

caatggcatg ccccccagct actgtatgcc cccgaccccc gcagaggcag aatgaaccca 2940 

tagggagctg atcgtaatgt ttatcatgtt acttccccac ccctacattt tttgaaataa 3000 

aataaggaat tttattctca cttcctgtgt ttcctgcacg ccaatgccag gccatggtat 3060 

tgggtgatag atgaggccct tctagctggg cctgggcacc aggaggggtc cccatgcttg 3120 

catctctctg tatcccctcc ctcccctgtg gccatcccac ccgcctctcc ctgctgcctc 3180 

tgaaattcat tctggggccc ggaacttggt ggaaatgacc caaaaacatt ggcccatctt 3240 

cctcctctca gcagccgacc ccagcccaat tctaaaacag ggctgagagc cacctctcag 3300 

cagctgaccc ctacccaagg agggtggcat ggaggggctt gcagagactc ttcctaacat 3360 

cctccccccc cagctgtctc cccaagtgca atctgccctc ccatccctgg gccagccagc 3420 
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ttccacagag cgccaggcca aacagaattc ctggcctcct tcgaaggggc tggagaaggc 3480 
cgggagcagt ggctcacgcc tgtaatccca gcactttggg aggctgaggc gggcagatca 3540 
caaagtcaag agattgagac catcctggcc aacatggtga aaccccgtci ctactaaaaa 3600 
tacaaaaatt aggccgggtg cggtggctca cgcctgtaat cccagcactr tgggaggccg 3660 
aggcgggcag atcacgaggt caggagatca agaccatcct ggctaacacg gtgaaacccc 3720 
gtctctacta aaaatacaaa aaattagctg ggcgaggtgg cgggtgcctg tagtcccagc 3780 
tacgtgggag gctgaggcaa gagaatggcg tgaacccccg cggggcagag cctgcagaga 3840 
gctgagatca caccactgta ctccagcctg ggcgacagcg agactccgtc tcaaaaaaaa 3900 
aaaaaaaaaa aattagctgg gcatggtggt gcgtgcctcc agtcccagct actcaggagg 3S60 
ctgagacggg agaatcgctt gaacctggga ggcagaggtt gcagtgagcc aagatcgctc 4020 
actccagcct agcgacagag tgagactcca tctcaaataa ataaataaat taattaatta 4080 
aattaaatt 4089 



<210> 3 
<211> 1757 
<212> DNA 

<213> Unknown Organism 



<220> 

<223> Description of Unknown Organism: 



unknown 



<400> 3 

ggccgcggcg gcagggcggg gccggagcag 
gctcttcgct tccctcgcgg tcttgctcgg 
tcgtgcgtgc agcatgggcg gcgtcgggga 
gggggcaccg ctgcccacct tctgctggga 
caagtggctg gtcatcgagc gccgcgtcta 
agggggcagc cgcctcatcg gccaccacgg 
cttccatcaa gatctcaatt ttgtgcgcaa 
ggctccggaa gaacccagcc aggatggacc 
agccctgcac caggcagccg aggacatgaa 
tttcctactg ggccacatcc tggccatgga 
gggtcctggc tgggtgccca gtgccctggc 
gtcctggtgt ctgcagcatg acctgggcca 
ccacgtggcc cagaagttcg tgatggggca 
cttccgccac ttccagcacc acgccaagcc 
ggtggcgccc gtcttcctcc tgggggagtc 
atacctaccc tacaaccagc agcacctgta 
cctggtgaac tttgaagtgg aaaatctggc 
tttgctctgg gccgccagct tctatgcccg 



cgggcggcgg cggaggcggc gcccgggagc 60 
acctcggcca ccgcctggga tccccaggac 120 
gccgggaccg cgggagggac ccgcgcagcc 180 
gcagatccgc gcgcacgacc agcccggcga 240 
cgacatcagc cgctgggcac agcggcaccc 300 
cgctgaggac gccacggatg ccttccgtgc 360 
gttcctacag cccctgttga ttggagagct 420 
cctgaatgcg cagctggtcg aggacttccg 460 
gctgtttgat gccagtccca ccttctttgc 540 
ggtgctggcc tggctcctta tctacctcct 600 
cgccttcatc ctggccatct ctcaggctca 660 
tgcctccatc ttcaagaagt cctggtggaa 720 
gctaaagggc ttctccgccc actggtggaa 780 
caacatcttc cacaaagacc cagacgtgac 840 
atccgtcgag tatggcaaga agaaacgcag 900 
ctticttcctg atcggcccgc cgctgctcac 960 
gtacatgctg gtgtgcatgc agtgggcgga 1020 
cttcttctta tcctacctcc ccttctacgg 1080 
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cgtccctggg gtgctgctct tctttgttgc tgtcagggtc ctggaaagcc actggtt cgt 1140 

gtggatcaca cagatgaacc acatccccaa ggagatcggc cacgagaagc accggcactg 1200 

ggtcagctct cagctggcag ccacctgcaa cgtggagccc tcacttttca ccaactcgtt 1260 

cagcgggcac ctcaacttcc agatcgagca ccacctcttc cccaggatgc cgagacacaa 1320 

ctacagccgg gtggccccgc tggtcaagtc gctgtgtgcc aagcacggcc tcagct^ cga 1380 

agtgaagccc ttcctcaccg cgctggtgga catcgtcagg tccctgaaga agtctgctga 1440 

catctggctg gacgcctacc tccatcagtg aaggcaacac ccaggcgggc agagaagggc 1500 

tcagggcacc agcaaccaag ccagccccgg cgggatcgat acccccaccc ctccacvggc 1560 

cagcctgggg gtgccctgcc tgccctcctg ctactgttgt cttcccctcg gcccccicac 1620 

atgtgtattc agcagcccta tggccttggc tctgggcctg atgggacagg ggtagaqqga 1680 

aggtgagcat agcacatttt cctagagcga caattggggg aaagctgtta tttttatatt 1740 

aaaatacatt cagatgt 1757 
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<210> 4 

<211> 444 

<212> PRT 

25 <213> Unknown Organism 
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<220> 

<223> Description of Unknown Organism: 
unknown 



<400> 4 

35 Met Ala Pro Asp Pro Val Ala Ala Glu Thr Ala Ala Gin Gly Pro Thr 

15 10 15 

Pro Arg Tyr Phe Thr Trp Asp Glu Val Ala Gin Arg Ser Gly Cys u 
40 20 25 30 

Glu Arg Trp Leu Val lie Asp Arg Lys Val Tyr Asn lie Ser Glu Phe 
35 40 45 



Thr Arg Arg His Pro Gly Gly Ser Arg Vai lie Ser His Tyr Ala Gly 
50 55 60 

Gin Asp Ala Thr Asp Pro Phe Val Ala Phe His He Asn Lys Gly Leu 

65 70 75 80 

Val Lys Lys Tyr Met Asn Ser Leu Leu He Gly Glu Leu Ser Pro Glu 
85 90 95 
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Gin Pro Ser Phe Glu Pro Thr Lys Asn Lys Glu Leu Thx Asp Glu Phe 
100 105 110 

Arg Glu Leu Arg Ala Thr Val Glu Arg Met Gly Leu Met Lys Ala Asn 
115 120 125 

His Val Phe Phe Leu Leu Tyr Leu Leu His He Leu Leu Leu Asp Gly 
130 135 140 

75 Ala Ala Trp Leu Thr Leu Trp Val Phe Gly Thr Ser Phe Leu Pro Phe 

145 150 155 160 

Leu Leu Cys Ala Val Leu Leu Ser Ala Val Gin Ala Gin Ala Gly Trp 
20 165 170 175 

Leu Gin His Asp Phe Gly His Leu Ser Val Phe Ser Thr Ser Lys Trp 
180 185 190 

25 

Asn His Leu Leu His His Phe Val He Gly His Leu Lys Gly Ala Pro 
195 200 205 

Ala Ser Trp Trp Asn His Met His Phe Gin His His Ala Lys Pro Asn 
210 215 220 

Cys Phe Arg Lys Asp Pro Asp He Asn Met His Pro Phe Phe Phe Ala 
225 230 235 240 

Leu Gly Lys He Leu Ser Val Glu Leu Gly Lys Gin Lys Lys Lys Tyr 
245 250 255 

Met Pro Tyr Asn His Gin His Lys Tyr Phe Phe Leu He Gly Pro Pro 

260 265 210 

Ala Leu Leu Pro Leu Tyr Phe Gin Trp Tyr He Phe Tyr Phe Val He 
275 280 285 

SO Gin Arg Lys Lys Trp Val Asp Leu Ala Trp Met He Thr Phe Tyr Val 

290 295 300 

Arg Phe Phe Leu Thr Tyr Val Pro Leu Leu Gly Leu Lys Ala Phe Leu 
SS 305 310 315 320 
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Gly Leu Phe Phe lie Val Arg Phe Leu Glu Ser Asn Trp Phe Val Trp 
325 330 335 

Val Thr Gin Met Asn His lie Pro Met His He Asp His Asp Arg Asn 
340 345 350 

Met Asp Trp Val Ser Thr Gin Leu Gin Ala Thr Cys Asn Val His Lys 
355 360 365 

Ser Ala Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu 

370 375 380 

His His Leu Phe Pro Thr Met Pro Arg His Asn Tyr His Lys Val Ala 
385 390 395 400 

Pro Leu Val Gin Ser Leu Cys Ala Lys His Gly He Glu Tyr Gin Ser 
405 410 415 

Lys Pro Leu Leu Ser Ala Phe Ala Asp He He His Ser Leu Lys Glu 
420 425 430 

Ser Gly Gin Leu Trp Leu Asp Ala Tyr Leu His Gin 
435 440 



35 

<210> 5 
<211> 444 
<212> PRT 
40 <213> Unknown Organism 



<220> 

<223> Description of Unknown Organism: 

unknown 
<400> 5 

Met Gly Lys Gly Gly Asn Gin Gly Glu Gly Ala Ala Glu Arg Glu Val 
15 10 15 

Ser Val Pro Thr Phe Ser Trp Glu Glu He Gin Lys His Asn Leu Arg 
20 25 30 
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Thr Asp Arg Trp Leu Val He Asp Arg Lys Val Tyr Asn He Thr Lys 
35 40 45 

Trp Ser He Gin Kis Pro Gly Gly Gin Arg Val He Gly His Tyr Ala 

50 55 60 

Gly Glu Asp Ala Thr Asp Ala Phe Arg Ala Phe His Pro Asp Leu Glu 
65 70 75 80 

Phe Val Gly Lys Phe Leu Lys Pro Leu Leu He Gly Glu Leu Ala Pro 
85 90 95 

Glu Glu Pro Ser Gin Asp His Gly Lys Asn Ser Lys He Thr Glu Asp 
100 105 110 

Phe Arg Ala Leu Arg Lys Thr Ala Glu Asp Met Asn Leu Phe Lys Thr 
115 120 125 

Asn His Val Phe Phe Leu Leu Leu Leu Ala His He lie Ala Leu Glu 
130 135 14G 

Ser lie Ala Trp Phe Thr Val Phe Tyr Phe Gly Asn Gly Trp He Pro 
145 150 155 160 

Thr Leu He Thr Ala Phe Val Leu Ala Thr Ser Gin Ala Gin Ala Gly 
165 170 175 

Trp Leu Gin His Asp Tyr Gly His Leu Ser Val Tyr Arg Lys Pro Lys 
180 165 190 

Trp Asn His Leu Val His Lys Phe Val He Gly His Leu Lys Gly Ala 
195 200 205 

Ser Ala Asn Trp Trp Asn His Arg His Phe Gin His His Ala Lys Pro 
210 215 220 

50 Asn He Phe His Lys Asp Pro Asp Val Asn Met Leu His Val Phe Val 

225 230 235 240 

Leu Gly Glu Trp Gin Pro He Glu Tyr Gly Lys Lys Lys Leu Lys Tyr 
55 245 250 255 
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5 



Leu Pro Tyr Asn His Gin His Glu Tyr Phe Phe Leu lie Gly Pro Pre 
260 265 270 

Leu Leu lie Pro Met Tyr Phe Gin Tyr Gin He He Met Thr Met He 

275 280 285 

Val His Lys Asn Trp Val Asp Leu Ala Trp Ala Val Ser Tyr Tyr He 
290 295 300 

Arg Phe Phe He Thr Tyr He Pro Phe Tyr Gly He Leu Gly Ala Leu 
305 310 315 320 

Leu Phe Leu Asn Phe He Arg Phe Leu Glu Ser His Trp Phe Val Trp 
325 330 335 

Val Thr Gin Met Asn His He Val Met Glu He Asp Gin Glu Ala Tyr 
340 345 350 

Arg Asp Trp Phe Ser Ser Gin Leu Thr Ala Thr Cys Asn Val Glu Gin 

355 360 365 

35 Ser Phe Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu 

370 375 380 



10 



is 



20 



25 



30 



40 



SO 



55 



His His Leu Phe Pro Thr Met Pro Arg His Asn Leu His Lys He Ala 
385 390 395 400 



Pro Leu Val Lys Ser Leu Cys Ala Lys His Gly He Glu Tyr Gin Glu 

45 405 410 415 

Lys Pro Leu Leu Arg Ala Leu Leu Asp He He Arg Ser Leu Lys Lys 
420 425 430 



Ser Gly Lys Leu Trp Leu Asp Ala Tyr Leu His Lys 
435 440 
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<210> 6 
<211> 445 
<212> PRT 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: 

unknown 
<400> 6 

Met Gly Gly Val Gly Glu Pro Gly Pro Arg Glu Gly Pro Ala Gin Pro 
15 10 15 

Gly Ala Pro Leu Pre Thr Fhe Cys Trp Glu Gin lie Arg Ala His Asp 
20 25 30 

Gin Pro Gly Asp Lys Trp Leu Val He Glu Arg Arg Val Tyr Asp He 
35 40 45 

Ser Arg Trp Ala Gin Arg His Pro Gly Gly Ser Arg Leu He Gly His 
50 55 60 

His Gly Ala Glu Asp Ala Thr Asp Ala Phe Arg Ala Phe His Gin Asp 
65 70 75 80 

Leu Asn Phe Val Arg Lys Phe Leu Gin Pro Leu Leu He Gly Glu Leu 
65 90 95 

Ala Pro Glu Glu Pro Ser Gin Asp Gly Pro Leu Asn Ala Gin Leu Val 
100 105 110 

Glu Asp Phe Arg Ala Leu His Gin Ala Ala Glu Asp Met Lys Leu Phe 
115 120 125 

Asp Ala Ser Pro Thr Phe Phe Ala Phe Leu Leu Gly His He Leu Ala 
130 135 140 

Met Glu Val Leu Ala Trp Leu Leu He Tyr Leu Leu Gly Pro Gly Trp 
145 150 155 160 
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Val Pro Ser Ala Leu Ala Ala Phe He Leu Ala He Ser Gin Ala Gin 
165 170 175 

5 

Ser Trp Cys Leu Gin His Asp Leu Gly His Ala Ser He Phe Lys Lys 
180 185 190 

10 Ser Trp Trp Asn His Val Ala Gin Lys Phe Val Met Gly Gin Leu Lys 

195 200 205 
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Gly Phe Ser Ala His Trp Trp Asn Phe Arg His Phe Gin His His Ala 
210 215 220 

Lys Pro Asn He Phe His Lys Asp Pro Asp Val Thr Val Ala Pro Val 

225 230 235 240 

Phe Leu Leu Gly Glu Ser Ser Val Glu Tyr Gly Lys Lys Lys Arg Arg 
245 250 255 

Tyr Leu Pro Tyr Asn Gin Gin His Leu Tyr Phe Phe Leu He Gly Pro 
260 265 270 

Pro Leu Leu Thr Leu Val Asn Phe Glu Val Glu Asn Leu Ala Tyr Met 
275 280 285 

Leu Val Cys Met Gin Trp Ala Asp Leu Leu Trp Ala Ala Ser Phe Tyr 
290 295 300 

Ala Arg Phe Phe Leu Ser Tyr Leu Pro Phe Tyr Gly Val Pro Gly Val 
305 310 315 320 

Leu Leu Phe Phe Val Ala Val Arg Val Leu Glu Ser His Trp Phe Val 
325 330 335 

Trp He Thr Gin Met Asn His He Pro Lys Glu He Gly His Glu Lys 
340 345 350 

His Arg Asp Trp Val Ser Ser Gin Leu Ala Ala Thr Cys Asn Val Glu 
355 360 365 

Pro Ser Leu Phe Thr Asn Trp Phe Ser Gly His Leu Asn Phe Gin He 
370 375 380 
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Glu His His Leu Phe Pro Arg Met Pro Arg His Asn Tyr Ser Arg Val 

365 390 395 400 

5 

Ala Pro Leu Val Lys Ser Leu Cys Ala Lys His Gly Leu Ser Tyr Glu 
405 410 415 

70* Val Lys Pro Phe Leu Thr Ala Leu Val Asp lie Val Arg Ser Leu Lys 

420 425 430 

Lys Ser Gly Asp lie Trp Leu Asp Ala Tyr Leu His Gin 
15 435 440 445 
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<210> 1 
<211> 18 
<212> DNA 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: 

unknown 
<400> 7 

cgcctgacag cccctgct 18 



<210> 8 

<211> 20 

<212> DNA 

40 <213> Unknown Organism 

<220> 

<223> Description of Unknown Organism: 



unknown 
<400> 8 

csggtggcca atcacaaaat 20 

<210> 9 
<211> 22 
<212> DNA 
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<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: 

unknown 
<40C> 9 

ctcaaagtgg aaccatctgc ta 22 

<210> 10 
<211> 2D 
<212> DNA 

<213> Unknown Organism 

<220> 

<223> Description of Unknown Organism: 
unknown 



<400> 10 

30 ggaaacccag tccatgttcc 20 

<210> 11 

<211> 21 

35 <212> DNA 

<213> Unknown Organism 

<220> 

40 <223> Description of Unknown Organism: 

unknown 



<400> 11 

cctgggcctt ttcttcatag t 21 

<210> 12 
<211> 15 
<212> DNA 

<213> Unknown Organism 



<220> 
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<223> Description of Unknown Organism: 



unknown 



70 



15 



<400> 12 

ctcaagctcc cctctgcct 

<210> 13 
<211> 19 
<212> DNA 

<213> Unknown Organism 



19 



20 



<220> 

<223> Description of Unknown Organism: 
unknown 
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<400> 13 

tcagaagcat aacctgcgc 



19 



30 



<210> 14 
<211> 20 
<212> DNA 

<213> Unknown Organism 



35 



<220> 

<223> Description of Unknown Organism: 



40 



unknown 
<400> 14 

ccagttcacc aatcagcagc 
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<210> 15 
<211> 20 
<212> DNA 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: 
unknown 
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< 4 r 0> 15 

ccr crgctga ttggtcaact 



<2:0> 16 
<2:i> 20 
10 <2Z2> DNA 
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<222> Unknown Oroanism 

<2 2C> 

<2~3> Description of Unknown Organism: 
unknown 

<4 00> 16 

tciagggcag gtatttcagc 

<2 j 0> 17 
<22 1> 19 
<212> DNA 

<2 2 3> Unknown Oicanism 

<22 0> 

<22 3> Description of Unknown Organism: 



35 unknown 



55 



<4 00> 17 

aacccatcga gtacaccaa 

<2i0> 18 
<211> 20 
<2':2> DNA 

<2:3> Unknown Oicanism 
<220> 

<223> Description of Unknown Organism: 

unknown 
<400> 18 

ccicagaaca aaaccccatc 
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<210> 19 

5 <211> 18 

<212> DNA 

<213> Unknown Organism 

10 <22Q> 

<223> Description of Unknown Organism: 
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unknown 
<400> 19 

tcttgctcgg acctcggc 

<210> 20 
<211> 20 
<212> DNA 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: 
unknown 



<400> 20 

35 gtgatccaca cgaaccagtg 



<210> 21 
40 <211> 19 



<212> DNA 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: 

unknown 
<400> 21 

gaagaaccca gccaggatg 
<210> 22 
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10 



15 



20 



25 



<211> 20 
<212> DNA 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: 

unknown 
<400> 22 

acagctttcc cccaattctc 20 



Claims 

1 . An isolated cDNA molecule selected trom the group consisting ol 



(a) a polynucleotide having at least a 65 % homology, preferably at least a 80 % homology with a polynucleotide 
encoding a polypeptide selected from the group consisting o1 me polypeptides of SEQ ID NO: 4, SEQ ID NO: 
5 and SEQ ID NO: 6; 

(b) a polynucleotide having at least a 65 % homology, preferably at least a 80 % homology with a polynucleotide 
30 which by virtue of the redundancy of the genetic code, encodes the same polypeptide selected trom the group 

consisting of the polypeptides of SEQ ID NO: 4, SEQ ID NO: 5 and SEQ ID NO: 6; 

(c) a DNA molecule capable of hybridization under stringent conditions to a DNA molecule according to (a) or 
(b); 

(d) a polynucleotide which is complementary to the polynucleotide of (a), (b) or (c); and 

35 (e) a oligonucleotide comprising at least 15 consecutive nucleotides of the polynucleotide of (a) : (b), (c) or (d). 

2. An isolated cDNA molecule selected from the group consisting ol 

(a) a polynucleotide having at least a 65 % homology, preferably at least a 80 % homology with a polynucleotide 
to sequence selected from the group consisting of the polynucleotides of SEQ ID NO: 1 , SEQ ID NO: 2 and SEQ 

ID NO: 3; 

(b) a DNA molecule capable of hybridization under stringent conditions to a DNA molecule according to (a); 

(c) a polynucleotide which is complementary to the polynucleotide of (a) or (b); 

(d) a oligonucleotide comprising at least 15 consecutive nucleotides of the polynucleotide of (a) : (b) or (c); and 
4 5 (e) a DNA which is synonymous to the DNAs of (a), (b), (c) oi (d) due to the degeneracy of the genetic code. 

3. A DNA comprising a nucleotide sequence with at least a 65 % homology with the nucleotide sequences as defined 
in claim 1 or 2. 

50 4. A recombinant vector comprising the DNA as claimed in any of claims 1 to 3. 

5. A transgenic host cell comprising the DNA as claimed in any of ciaims 1 to 3. 

6. A transgenetic host cell transformed by the DNA according to any of claims 1 to 3 or the vector according to claim 
55 4 f a corresponding transgenetic organism or a conesponding transgenetic knock-in or knock-out animal model. 

7. A polypeptide comprising at least 65 % of a polypeptide sequence selected from the group consisting of the 
polypeptides of SEQ ID NO: 1 , SEQ ID NO: 2 and SEQ ID NO: S or its salt. 
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8. A polypeptide comprising a polypeptide sequence with at least a 85 % homology with the polypeptide sequence 
as claimed in claim 7, or its salt. 

9. A peptide comprising at least 15 consecutive amino acids of the polypeptide as claimed in claim 7, or its salt. 

s 

10. A polypeptide having substantially the same amino acid sequence as the polypeptide as claimed in claim 1 , or 
having a variant of the amino acid sequence ot the polypeptide as claimed in claim 7 with a deletion, addition or 
substitution o1 1 to 10 amino acids, or its salt. 

io 11. A process loi producing a polypeptide comprising expressing from the host cell ot claim 5 or 6 a polypeptide 
encoded by the DNA as claimed in any of claims 1 to 3. 

12. An antibody against the polypeptide of any of claims 7 to 10. 

is 13. A oligonucleotide primer having a nucleotide sequence selected from the group consisting ot the nucleotide se- 
quences ot SEQ ID NO: 7 to SEQ ID NO: 22. 

14. A method ot screening for modulators in known assays using constructs or ot screening for interacting proteins or 
factors using state of the art technologies. 



20 



25 



30 



35 



AO 



45 



50 



65 



15. A method of screening chemical libraries comprising transformed cell lines. 

16. A compound which alters or reacts with at least one epitope of the proteins and which is obtained by screening 
methods as claimed in claim 14 or 15. 

17. The use of the antibodies according to claim 12 for diagnostic or therapeutic purposes. 

18. A pharmaceutical composition comprising as an effective component an effective amount of the peptide as claimed 
in any of claims 7 to 10, or its salt, and a pharmaceutical^ acceptable carrier or diluent. 
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Fig.1 
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I | transmembrane region 
( | conserved histidine box 
▼ invarianl amino acid residue 
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Fig.2 



A9 Mouse 



A9 Rat 



L- A9 Hamster 



A9 Human 



A9 Pig 



A9 Yeast 
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A5 Mortierella 
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Fig. 3 

FADS1 cDNA 

CACTCCTGGAGCCCGCGGACCCCGAGCACGCGCCTGACAGCCCCTGCTGGCCCGGCGCGCGGCG 
TCGCCAGGCCAGCTATGGCCCCCGACCCGGTGGCCGCCGAGACCGCGGCTCAGGGACCTACCCC 
GCGCTACTTCACCTGGGACGAGGTGGCCCAGCGCTCAGGGTGCGAGGAGCGGTGGCTAGTGATC 
GACCGTAAGGTGTACAACATCAGCGAGTTCACCCGCCGGCATCCAGGGGGCTCCCGGGTCATCA 
GCCACTACGCCGGGCAGGATGCCACGGATCCCTTTG1 GGCCTTCCACATCAACAAGGGCCTTGT 
GAAGAAGTATATGAACTCTCTCCTGATTGGAGAACTGTCTCCAGAGCAGCCCAGCTTTGAGCCC 
ACCAAGAATAAAGAGCTGACAGATGAGTTCCGGGAGCTGCGGGCCACAGTGGAGCGGATGGGGC 
TCATGAAGGCCAACCATGTCTTCTTCCTGCTGTACCTGCTGCACATCTTGCTGCTGGATGGTGC 
AGCCTGGCTCACCCTTTGGGTCTTTGGGACGTCCTTTTTGCCCTTCCTCCTCTGTGCGGTGCTG 
CTCAGTGCAGTTCAGGCCCAGGCTGGCTGGCTGCAGCATGACTTTGGGCACCTGTCGGTCTTCA 
GCACCTCAAAGTGGAACCATCTGCTACATCATTTTGTGATTGGCCACCTGAAGGGGGCCCCCGC 
C AG T T GG T G GAACC AC AT G C AC T T C C A G C ACCAT GC C AAG C C C AAC T GC T T C CGC AAAG AC C C A 

GACATCAACATGCATCCCTTCTTCTTTGCCTTGGGGAAGATCCTCTCTGTGGAGCTTGGGAAAC 
AGAAGAAAAAATATATGCCGTACAACCACCAGCACAAATACTTCTTCCTAATTGGGCCCCCAGC 
CTTGCTGCCTCTCTACTTCCAGTGGTATATTTTCTATTTTGTTATCCAGCGAAAGAAGTGGGTG 
GACTTGGCCTGGATGATTACCTTCTACGTCCGCTTCTTCCTCACTTATGTGCCACTATTGGGGC 
TGAAAGCCTTCCTGGGCCTTTTCTTCATAGTCAGGTTCCTGGAAAGCAACTGGTTTGTGTGGGT 
GACACAGATGAACCATATTCCCATGCACATTGATCATGACCGGAACATGGACTGGGTTTCCACC 
CAGCTCCAGGCCACATGCAATGTCCACAAGTCTGCCTTCAATGACTGGTTCAGTGGACACCTCA 
ACTTCCAGATTGAGCACCATCTTTTTCCCACGATGCCTCGACACAATTACCACA7UVGTGGCTCC 
CCTGGTGCAGTCCTTGTGTGCCAAGCATGGCATAGAGTACCAGTCCAAGCCCCTGCTGTCAGCC 
TTCGCCGACATCATCCACTCACTAAAGGAGTCAGGGCAGCTCTGGCTAGATGCCTATCTTCACC 
AATj^AC AACAGCC AC C C T G C C C AG T C T G G AAGAAGAG G AGG AAGAC T C T GGAGCC AAG G C A GAG 
GGGAGCTTGAGGGACAATGCCACTATAGTTTAATACTCAGAGGGGGTTGGGTTTGGGGACATAA 
AGCCTCTGACTCAAACTCCTCCCTTTTATCTTCTAGCCACAGTTCTAAGACCCAAAGTGGGGGG 
TGGACACAGAAGTCCCTAGGAGGGAAGGAGCTGTTGGGGCAGGGGTGTAAATTATTTCCTTTTT 
CTAGTTTGGCACATGCAGGTAGTTGGTGAACAGAGAGAACCAGGAGGGTAACAGAAGAGGAGGG 
ACCTACTGAACCCAGAGTCAGGAAGAGATTTAACACTAAAATTCCACTCATGCCGGGCGTGGTG 
CACGCGCCTGTAATCCCAGCTACCCAGGAGGCTGAGGCAGGAGAATCGCTTGAACCGGGGAGGT 
GGAGGTTGCAGTGAGCTGAGATCACGCCATTGTACTCCAGCCTGGGCGACAGAGCAAGACTCCA 
TTTCAAAAAAAAAAAAAAAAAAAAAAATCCACTCATATAAAAGGTGAGCTCAGCTCACTGGTCC 
ATTTCTCAGTGGCTTCTCCATCCTCATTTGCAAACCTCAGAGGGATAAGGCAGTTGAACCTGAT 
GAGC AAGAAT TATAACAGC AAGGAAAC A T T AATGC T TAGAAT TC TGAGATCCAGC AC AAC T CAG 
TCTGTGGGAGCTCAGCTCGCTGCCCAGGGATAGGTATGACCTATGTCTGCCTTAGGCTGCTGGG 
AGATGCCATTCTCCAGTTTCAGAAGCAGGCAGGGCAAAGGTCAAGACTGTGGTATTGGGGTCTT 
TTGGCTCTGAAGGATCCTGGAACCACTGATTTTGGTTTATTCCCTCCAGGGTCTAAAGAGAACA 
AGAGGTGCTAGCTCTTACCAAAACAGAI'GGTAGAGAGAGTTGCTGGCTATTTAAAAAGCTCTTT 
CATCTTTTAATTCACCTCTTCTTTTCACCTCTTTAACCACTCCTCAGGAACAGAACACTTCTAG 
GACTGGGGGTCTTTTAGCTCCATAAGCAAGTGAGCAGATGGGACAAGTTAGTCTTTTCTCCCTA 
GAAACAAAGGGGATGCCCAGTGGTTTCCCTTTGCTTCCCAACCTAAAATTTCAAGTTTAATAAA 
ATAGCAATTAGCAGAAGTGACCAAATTGGGAGATAATTATCAGTCATGAGGAAAGACACAGATT 
TCGGTCATAAAGAA r rGTAAGGGCTATAAGTAGAAAClTTCTATAACCTAAATGATGTTATAGAA 
TTATTTTTGAGCAGGAGCAGAAAGATTA.AATATGATCACTTCATACTTCTAAATCAGAAATAGG 
AAGATTAAAACCACAGAACAGTTTGTGATTTCTATTGCTGGTAGCTAGGTATCTTACTCTGTCC 
AC TC T TG T TCAAGT AT C T AAC TC T TC TG G AAACCAAAT AGGC T T T AGAAGAGATT AT CC TATAT 
TC C TAT CAG T AT AAT AC 1' AAAAT G T AAC T T T T T AATC AT C T G G T T T T T AAAAGAT AAAC AG T T T 



BNSDOCID: <EP 1035207A1_I_> 



55 



EP 1 035 207 A1 



Fig. 3 cont. 

AGCCCATCTCTCCAGAGAGCAAACATAGGAATATGACTCA^GAGCCTCCTAGGGCTTATCATCA 
GCCCTCACACCCGCTTCCCCCTCCAACCCACAGCCTTTGCTTCCAGGTGGCAGGATTACTACTT 
TGCCTCTTCAGCAGCATCTACTCTAGGCATATTGATCATTTTAGACACTGGGAGAAGAGAACCT 
CAAACTAGGAGGAAAAGACAGAGCCTCCACTTAGTTTTGGGAGGGGATGGCAGACAGTCAAGGA 
GATGAGCGTCCTAAGGCATGTTGGGATAGGGTCAGATGCACCACCCATGGAGAGGTTTGTCAAC 
ACAAAGACATGGAAGGTTAGAGGTTTGTCAACAAAAAGAO.TGGAAGGTTAGGTTTGTCAACAC 
AAAGACATGGAAGATTAGAGGTTTGTCAACACAAAGATACAGGAAGAATGGGCTGCAGAAGATT 
TAGATGTTTTCCATTTGGGCACATTTTACTTAGCTGGAGA^.CTAGGTTTAAAACAGCCTGGGTA 
GGAAAATTAGAAGCAAGCTGGATGCAGTGGCTCATGCCTG7AATCCCAACACTTTTGGGAGGTC 
CAGGCAGGAGGATCACTTGGGCCCAGGAGGTCAAGCCTGCAGCGAGCTGAGATCACACCACTGC 
ACTCCAGCCTGGGGTGATAGAACAAGACCCTGTCTCAAAA^JVAAAAAAAAACAACAAAAACTTA 
GAATTGAGGAGTTGTACCTCCATTGGCTTCCTCACTCCAAAATAGGTGCTGATCCTTCCTATTC 
CTATTCTTTGCCACCTTTTGGGTGTGGTGTCACCAGCCTGTTTAGCCAAGTAGCTTTGGGCATA 
GGCTGCCCAATCTGAGCAAACACCAGTGAGGCTCTATTGAGCAAGACCAAGTCCTCAAAGCACC 
TGAACCACTGTGGCCTTCTCAGCCTACAGCAGTGTGGTCTCTTACATGGCCACAAAGGGACACA 
CAGTGACAAAAGGCTCGGAATGTTACAATGGTAAAATGAGTGATCTCAAATCCACTGACAGATA 
TAAAATAGGCTTAGAGAGGAAAAGCTGCCTCTGGTCAAGTAGATCATGGCAGCATGAATTCCAA 
CTCACTTTTTTACGAACTCCAACTTCTATGTTTATCTTTG'J TACTTTCACTTTTTTACAACCTG 
NCAGAGGGATTTTTTAAATCAGGCCCAATATCAGTATTCTTTTTGTGTGTGCCAATTTTGTTAT 
CACATCCCTATGAAGTTGAAAAATAAAGTTAATTTTGACCAAAAG 
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Fig. 4 

FADS 2 cDNA 

CGTCACAGTCGGCAGGCAGC ATG GGGAAGGGAGGGAACCAGGGCGAGGGGGCCGCCGAGCGCGA 
GGTGTCGGTGCCCACCTTCAGCTGGGAGGAGATTCAGAAGCATAACCTGCGCACCGACAGGTGG 
CTGGTCATTGACCGCAAGGTTTACAACATCACCAAATGGTCCATCCAGCACCCGGGGGGCCAGC 
GGGTCATCGGGCACTACGCTGGAGAAGATGCAACGGATGCCTTCCGCGCCTTCCACCCTGACCT 
GGAATTCGTGGGCAAGTTCTTGAAACCCCTGCTGATTGGTGAACTGGCCCCGGAGGAGCCCAGC 
CAGGACCACGGCAAGAACTCAAAGATCACTGAGGACTTCCGGGCCCTGAGGAAGACGGCTGAGG 
ACATGAACCTGTTCAAGACCAACCACGTGTTCTTCCTCCTCCTCCTGGCCCACATCATCGCCCT 
GGAGAGCATTGCATGGTTCACTGTCTTTTACTTTGGCAATGGCTGGATTCCTACCCTCATCACG 
GCCTTTGTCCTTGCTACCTCTCAGGCCCAAGCTGGATGGCTGCAACATGATTATGGCCACCTGT 
CTGTCTACAGAAAACCCAAGTGGAACCACCTTGTCCACAAATTCGTCATTGGCCACTT/LAAGGG 
TGCCTCTGCCAACTGGTGGAATCATCGCCACTTCCAGCACCACGCCAAGCCTAACATCTTCCAC 
AAGGATCCCGATGTGAACATGCTGCACGTGTTTGTTCTGGGCGAATGGCAGCCCATCGAGTACG 
GCAAGAAGAAGCTGAAATACCTGCCCTACAATCACCAGCACGAATACTTCTTCCTGATTGGGCC 
GCCGCTGCTCATCCCCATGTATTTCCAGTACCAGATCATCATGACCATGATCGTCCATAAGAAC 
TGGGTGGACCTGGCCTGGGCCGTCAGCTACTACATCCGGTTCTTCATCACCTACATCCCTTTCT 
ACGGCATCCTGGGAGCCCTCCTTTTCCTCAACTTCATCAGGTTCCTGGAGAGCCACTGGTTTGT 
GTGGGTCACACAGATGAATCACATCGTCATGGAGATTGACCAGGAGGCCTACCGTGACTGGTTC 
AGTAGCCAGCTGACAGCCACCTGCAACGTGGAGCAGTCCTTCTTCAACGACTGGTTCAGTGGAC 
ACCTTAACTTCCAGATTGAGCACCACCTCTTCCCCACCATGCCCCGGCACAACTTACACAAGAT 
CGCCCCGCTGGTGAAGTCTCTATGTGCCAAGCATGGCATTGAATACCAGGAGAAGCCGCTACTG 
AGGGCCCTGCTGGACATCATCAGGTCCCTGAAGAAGTCTGGGAAGCTGTGGCTGGACGCCTACC 
TTCACAAATg^AGCCACAGCCCCCGGGACACCGTGGGGAAGGGGTGCAGGTGGGGTGATGGCCA 
GAGGAATGATGGGCTTTTGTTCTGAGGGGTGTCCGAGAGGCTGGTGTATGCACTGCTCACGGAC 
CCCATGTTGGATCTTTCTCCCTTTCTCCTCTCCTTTTTCTCTTCACATCTCCCCCATAGCACCC 
TGCCCTCATGGGACCTGCCCTCCCTCAGCCGTCAGCCATCAGCCATGGCCCTCCCAGTGCCTCC 
TAGCCCCTTCTTCCAAGGAGCAGAGAGGTGGCCACCGGGGGTGGCTCTGTCCTACCTCCACTCT 
CTGCCCCTAAAGATGGGAGGAGACCAGCGGTCCATGGGTCTGGCCTGTGAGTCTCCCCTTGCAG 
CCTGGTCACTAGGCATCACCCCCGCTTTGGTTCTTCAGATGCTCTTGGGGTTCATAGGGGCAGG 
TCCTAGTCGGGCAGGGCCCCTGACCCTCCCGGCCTGGCTTCACTCTCCCTGACGGCTGCCATTG 
GTCCACCCTTTCATAGAGAGGCCTGCTTTGTTACAAAGCTCGGGTCTCCCTCCTGCAGCTCGGT 
TAAGTACCCGAGGCCTCTCTTAAGATGTCCAGGGCCCCAGGCCCGCGGGCACAGCCAGCCCAAA 
CCTTGGGCCCTGGAAGAGTCCTCCACCCCATCACTAGAGTGCTCTGACCCTGGGCTTTCACGGG 
CCCCATTCCACCGCCTCCCCAACTTGAGCCTGTGACCTTGGGACCAAAGGGGGAGTCCCTCGTC 
TCTTGTGACTCAGCAGAGGCAGTGGCCACGTTCAGGGAGGGGCCGGCTGGCCTGGAGGCTCAGC 
CCACCCTCCAGCTTTTCCTCAGGGTGTCCTGAGGTCCAAGATTCTGGAGCAATCTGACCCTTCT 
CCAAAGGCTCTGTTATCAGCTGGGCAGTGCCAGCCAATCCCTGGCCATTTGGCCCCAGGGGACG 
TGGGCCCTGCAGGCTGCAGGAGGGCACTGGAGCTGGGAGGTCTCGTCCCAGCCCTCCCCATCTC 
GGGGCTGCTGTGTGGACGGCGCTGCCTCAGGCACTCTCCTGTCTGAACCTGCCCTTACTGTGTT 
TAACCTGTTGCTCCAGGATGCATTCTGATAGGAGGGGGCGGCAGGGCTGGGCCTTGTGACAATC 
TGCCTTTCACCACATGGCCTTGCCTCGGTGGCCCTGACTGTCAGGGAGGGCCAGGGAGGCAGAG 
CGGGAGGGAGTCTCAGGAGGAGGCTGCCCTGAGGGGCTGGGGAGGGGGTACCTCATGAGGACCA 
GGGTGGAGCTGAGAAGAGGAGGAGGTGGGGGCTGGAGGTGCTGGTAGCTGAGGGGACGGGCAAG 
TGAGAGGGGAGGGAGGGAAGTCCTGGGAGGATCCTGAGCTGCTGTTGCAGTCTAACCCACTAAT 
CAGTTCTTAGATTCAGGGGAAGGGCAGGCACCAACAACTCAGAATGGGGGCTTTCGGGGAGGGC 
GCCTAGTCCCCCCAGCTCT7\AGCAGCCAGGAGGGACCTGCATCTAAGCATCTGGGTTGCCATGG 
CAATGGCATGCCCCCCAGCTACTGTATGCCCCCGACCCCCGCAGAGGCAGAATGAACCCATAGG 
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Fig. 4 cont. 

GAGCTGATCGTAATG T TTATCATGT T ACTTC CCCAC CCC TACAT T TTT TGAAMAAAATAAGGA 
ATTTTATTCTCACTTCCTGTGTTTCCTGCACGCCAATGCCAGGCCATGGTATTGGGTGATAGAT 
GAGGCCCTTCTAGCTGGGCCTGGGCACCAGGAGGGGTCCCCATGCTTGCATCTCTCTGTATCCC 
CTCCCTCCCCTGTGGCCATCCCACCCGCCTCTCCCTGCTGCCTCTGAAATTCATTCTGGGGCCC 
GGAACTTGGTGGAAATGACCCAAAAACATTGGCCCATCTTCCTCCTCTCAGCAGCCGACCCCAG 
CCCAATTCTAAAACAGGGCTGAGAGCCACCTCTCAGCAGCTGACCCCTACCCAAGGAGGGTGGC 
ATGGAGGGGCTTGCAGAGACTCTTCCTAACATCCTCCCCCCCCAGCTGTCTCCCCAAGTGCAAT 
CTGCCCTCCCATCCCTGGGCCAGCCAGCTTCCACAGAGCGCCAGGCCAAACAGAATTCCTGGCC 
TCCTTGGAAGGGGCTGGAGAAGGCCGGGAGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGG 
AGGCTGAGGCGGGCAGATCACAAAGTCAAGAGATTGAGACCATCCTGGCCAACATGGTGAAACC 
CCGTCTCTACTAAAAATACAAAAATTAGGCCGGGTGCGGTGGCTCACGCCTGTAATCCCAGCAC 
TTTGGGAGGCCGAGGCGGGCAGATCACGAGGTCAGGAGATCAAGACCATCCTGGCTAACACGGT 
GAAACC CC G TC TC TAC TAAAAATACAAAAAA T TAGC T G GGCGAGGTGG C GGG TGC C TGTAGT CC 
CAGCTACGTGGGAGGCTGAGGCAAGAGAATGGCGTGAACCCCGGCGGGGCAGAGCCTGCAGAGA 
GC T GAGA T CAC ACCAC TG TAC T C C AG C C TGG G C GACA G C G AGAC TCC G T C T C AAAAAAAAAAAA 
AAAAAAAATTAGCTGGGCATGGTGGj GCGTGCCTGCAGTCCCAGCTACTCAGGAGGCTGAGACG 
GGAGAATCGCTTGAACCTGGGAGGCAGAGGTTGCAGTGAGCCAAGATCGCTCACTCCAGCCTAG 
CG ACAGA G T G AGAC T C CAT C T C A^ f T AJ^T^AMAAAT T AAT T AAT T AAAT T AAAT T 
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Fig. 5 

FADS 3 cDNA 

GGCCGCGGCGGCAGGGCGGGGCCGGAGCAGCGGGCGGCGGCGGAGGCGGCGCCCGGGAGCGCTC 

TTCGCTTCCCTCGGGGTCTTGCTCGGACCTCGGCCACCGCCTGGGATCCCCAGGACTCGTGCGT 

GCAGCATGGGCGGCGTCGGGGAGCCGGGACCGCGGGAGGGACCCGCGCAGCCGGGGGCACCGCT 

GCCCACCTTCTGCTGGGAGCAGATCCGCGCGCACGACCAGCCCGGCGACAAGTGGCTGGTCATC 

GAGCGCCGCGTCTACGACATCAGCCGCTGGGCACAGCGGCACCCAGGGGGCAGCCGCCTCATCG 

GCCACCACGGCGCTGAGGACGCCACGGATGCCTTCCGTGCCTTCCATCAAGATCTCAATTTTGT 

GCGCAAGTTCCTACAGCCCCTGTTGATTGGAGAGCTGGCTCCGGAAGAACCCAGCCAGGATGGA 

CCCCTGAATGCGCAGCTGGTCGAGGACTTCCGAGCCCTGCACCAGGCAGCCGAGGACATGAAGC 

TGTTTGATGCCAGTCCCACCTTCTTTGCTTTCCTACTGGGCCACATCCTGGCCATGGAGGTGCT 

GGCCTGGCTCCTTATCTACCTCCTGGGTCCTGGCTGGGTGCCCAGTGCCCTGGCCGCCTTCATC 

CTGGCCATCTCTCAGGCTCAGTCCTGGTGTCTGCAGCATGACCTGGGCCATGCCTCCATCTTCA 

AGAAGTCCTGGTGGAACCACGTGGCCCAGAAGTTCGTGATGGGGCAGCTAAAGGGCTTCTCCGC 

CCACTGGTGGAACTTCCGCCACTTCCAGCACCACGCCAAGCCCAACATCTTCCACAAAGACCCA 

GACGTGACGGTGGCGCCCGTCTTCCTCCTGGGGGAGTCATCCGTCGAGTATGGCAAGAAGAAAC 

GCAGATACCTACCCTACAACCAGCAGCACCTGTACTTCTTCCTGATCGGCCCGCCGCTGCTCAC 

CCTGGTGAACTTTGAAGTGGAAAATCTGGCGTACATGCTGGTGTGCATGCAGTGGGCGGATTTG 

CTCTGGGCCGCCAGCTTCTATGCCCGCTTCTTCTTATCCTACCTCCCCTTCTACGGCGTCCCTG 

GGGTGCTGCTCTTCTTTGTTGCTGTCAGGGTCCTGGAAAGCCACTGGTTCGTGTGGATCACACA 

GATGAACCACATCCCCAAGGAGATCGGCCACGAGAAGCACCGGGACTGGGTCAGCTCTCAGCTG 

GCAGCCACCTGCAACGTGGAGCCCTCACTTTTCACCAACTGGTTCAGCGGGCACCTCAACTTCC 

AGATCGAGCACCACCTCTTCCCCAGGATGCCGAGACACAACTACAGCCGGGTGGCCCCGCTGGT 

CAAGTCGCTGTGTGCCAAGCACGGCCTCAGCTACGAAGTGAAGCCCTTCCTCACCGCGCTGGTG 

GACATCGTCAGGTCCCTGAAGAAGTCTGGTGACATCTGGCTGGACGCCTACCTCCATCAGTGAA 

GGCAACACCCAGGCGGGCAGAGAAGGGCTCAGGGCACCAGCAACCAAGCCAGCCCCGGCGGGAT 

CGATACCCCCACCCCTCCACTGGCCAGCCTGGGGGTGCCCTGCCTGCCCTCCTGGTACTGTTGT 

CTTCCCCTCGGCCCCCTCACATGTGTATTCAGCAGCCCTATGGCCTTGGCTCTGGGCCTGATGG 

GACAGGGGTAGAGGGAAGGTGAGCATAGCACATTTTCCTAGAGCGAGAATTGGGGGAAAGCTGT 
TAT T T T TAT AT T AAAAT AC A TTCAGATGT 
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Fig. 6 



FAD SI 

Met Ala Pro Asp Pro Val Ala Ala Glu Thr Ala Ala Gin Gly Pro Thr 
15 10 15 

Pro Arg Tyr Phe Thr Trp Asp Glu Val Ala Gin Arg Ser Gly Cys Glu 
20 25 30 

Glu Arg Trp Leu Val lie Asp Arg Lys Val Tyr Asn lie Ser Glu Phe 
35 4 0 45 

Thr Arg Arg His Pro Gly Gly Ser Arg Val lie Ser His Tyr Ala Gly 

50 60 

Gin Asp Ala Thr Asp Pro Phe Val Ala Phe His lie Asn Lys Gly Leu 
65 70 75 80 

Val Lys Lys Tyr Met Asn Ser Leu Leu lie Gly Glu Leu Ser Pro Glu 
85 90 95 

Gin Pro Ser Phe Glu Pro Thr Lys Asn Lys Glu Leu Thr Asp Glu Phe 
100 * 105 110 

Arg Glu Leu Arg Ala Thr Val Glu Arg Met Gly Leu Met Lys Ala Asn 
115 120 125 

His Val Phe Phe Leu Leu Tyr Leu Leu His lie Leu Leu Leu Asp Gly 
130 135 140 

Ala Ala Trp Leu Thr Leu Trp Val Phe Gly Thr Ser Phe Leu Pro Phe 
145 150 155 160 

Leu Leu Cys Ala Val Leu Leu Ser Ala Val Gin Ala Gin Ala Gly Trp 
165 170 175 

Leu Gin His Asp Phe Gly His Leu Ser Val Phe Ser Thr Ser Lys Trp 
180 " 185 190 

Asn His Leu Leu His His Phe Val lie Gly His Leu Lys Gly Ala Pro 
195 200 205 

Ala Ser Trp Trp Asn His Met His Phe Gin His His Ala Lys Pro Asn 

210 215 220 

Cys Phe Arg Lys Asp Pro Asp lie Asn Met His Pro Phe Phe Phe Ala 
225 * 230 235 240 

Leu Gly Lys lie Leu Ser Val Glu Leu Gly Lys Gin Lys Lys Lys Tyr 
245 250 ~ 255 

Met Pro Tyr Asn His Gin His Lys Tyr Phe Phe Leu lie Gly Pro Pro 
260 265 270 

Ala Leu Leu Pro Leu Tyr Phe Gin Trp Tyr lie Phe Tyr Phe Val lie 
275 280 285 
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Fig. 6 cont. 



Gin Axg Lys Lys Trp Val Asp Leu Ala Trp Met lie Thr Phe Tyr Val 
290 295 300 

Arg Phe Phe Leu Thr Tyr Val Pro Leu Leu Gly Leu Lys Ala Phe Leu 
305 3J0 315 320 

Gly Leu Phe Phe lie Val Arg Phe Leu Glu Ser Asn Trp Phe Val Trp 
325 330 335 

Val Thr Gin Met Asn His He Pro Met His He Asp His Asp Arg Asn 
340 345 350 

Met Asp Trp Val Ser Thx Gin Leu Gin Ala Thr Cys Asn Val His Lys 
355 360 365 

Ser Ala Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu 
370 375 380 

His His Leu Phe Pro Thr Met Pro Arg His Asn Tyr His Lys Val Ala 
385 390 395 400 

Pro Leu Va] Gin Ser Leu Cys Ala Lys His Gly He Glu Tyr Gin Ser 
4 05 " 410 415 

Lys Pro Leu Leu Ser Ala Phe Ala Asp He He His Ser Leu Lys Glu 
420 425 430 

Ser Gly Gin Leu Trp Leu Asp Ala Tyr Leu His Gin 
43i ' 440 
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Fig. 7 



FADS 2 

Met Gly Lys Gly Gly Asn Gin Gly Glu Gly Ala Ala Glu Arg Glu Vai 
15 10 15 

Ser Val Pro Thr Phe Ser Trp Glu Glu lie Gin Lys His Asn Leu Arg 
20 25 30 

Thr Asp Arg Trp Leu Val lie Asp Arg Lys Val Tyr Asn He Thr Lys 
35 40 45 

Trp Ser He Gin His Pro Gly Gly Gin Arg Val He Gly His Tyr Ala 

50 55 60 

Gly Glu Asp Ala Thr Asp Ala Phe Arg Ala Phe His Pro Asp Leu Glu 

65 70 75 60 

Phe Val Gly Lys Phe Leu Lys Pro Leu Leu He Gly Glu Leu Ala Pro 
85 90 95 

Glu Glu Pro Ser Gin Asp His Gly Lys Asn Ser Lys He Thr Glu Asp 
100 105 110 

Phe Arg Ala Leu Arg Lys Thr Ala Glu Asp Met Asn Leu Phe Lys Thr 
115 120 125 

Asn His Val Phe Phe Leu Leu Leu Leu Ala His He He Ala Leu Glu 
130 135 140 

Ser He Ala Trp Phe Thr Val Phe Tyr Phe Gly Asn Gly Trp He Pro 
145 150 155 160 

Thr Leu He Thr Ala Phe Val Leu Ala Thr Ser Gin Ala Gin Ala Gly 
165 170 175 

Trp Leu Gin His Asp Tyr Gly His Leu Ser Val Tyr Arg Lys Pro Lys 
180 165 190 

Trp Asn His Leu Val His Lys Phe Val He Gly His Leu Lys Gly Ala 
195 200 205 

Ser Ala Asn Trp Trp Asn His Arg His Phe Gin His His Ala Lys Pro 
210 215 220 

Asn He Phe His Lys Asp Pro Asp Val Asn Met Leu His Val Phe Val 

225 230 235 240 

Leu Gly Glu Trp Gin Pro He Glu Tyr Gly Lys Lys Lys Leu Lys Tyr 
245 250 255 

Leu Pro Tyr Asn His Gin His Glu Tyr Phe Phe Leu He Gly Pro Pro 
260 265 270 

Leu Leu He Pro Met Tyi Phe Gin Tyr Gin He He Met Thr Met He 
275 280 285 
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Fig. 7 cont. 



Val His Lys Asn Trp Val Asp Leu Ala Trp Ala Val Ser Tyr Tyr lie 
290 295 300 

Arg Phe Phe lie Thr Tyr He Pro Phe Tyr Gly He Leu Gly Ala Leu 

30b 310 31b 320 

Leu Phe Leu Asn Fhe He Arg Phe Leu Glu Ser His Trp Phe Val Trp 

32b 330 33b 

Val Thr Gin Met Asn His He Val Met Glu He Asp Gin Glu Ala Tyr 
340 345 350 

Arg Asp Trp Phe Ser Ser Gin Leu Thr Ala Thr Cys Asn Val Glu Gin 
355 3b0 36b 

Ser Phe Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu 
370 375 380 

His His Leu Phe Fro Thr Met Pro Arg His Asn Leu His Lys He Ala 
305 390 395 400 

Pro Leu Val Lys Ser Leu Cys Ala Lys His Gly He Glu Tyr Gin Glu 
405 410 415 

Lys Pro Leu Leu Arg Ala Leu Leu Asp He He Arg Ser Leu Lys Lys 
420 425 430 

Ser Gly Lys Leu Trp Leu Asp Ala Tyr Leu His Lys 
435 440 
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Fig. 8 



FADS 3 

Met Gly Gly Val Gly Glu Pro Gly Pro Arg Glu Gly Pro Ala Gin Pro 
15 10 15 

Gly Ala Pro Leu Pro Thr Phe Cys Trp Glu Gin lie Arg Ala His Asp 
20 25 30 

Gin Pro Gly Asp Lys Trp Leu Val He Glu Arg Arg Val Tyr Asp He 
35 40 45 

Ser Arg Trp Ala Gin Arg His Pro Gly Gly Ser Arg Leu He Gly His 
50 55 * 60 

Kis Gly Ala Glu Asp Ala Thr Asp Ala Phe Arg Ala Phe His Gin Asp 
65 70 75 80 

Leu Asn Phe Val Arg Lys Phe Leu Gin Pro Leu Leu He Gly Glu Leu 
85 " 90 95 

Ala Pro Glu Glu Pro Ser Gin Asp Gly Pro Leu Asn Ala Gin Leu Val 
100 105 110 

Glu Asp Phe Arg Ala Leu His Gin Ala Ala Glu Asp Met Lys Leu Phe 
115 120 125 

Asp Ala Ser Pro Thr Phe Phe Ala Phe Leu Leu Gly His He Leu Ala 
130 135 140 

Met Glu Val Leu Ala Trp Leu Leu He Tyr Leu Leu Gly Pro Gly Trp 
145 150 155 160 

Val Pro Ser Ala Leu Ala Ala Phe He Leu Ala lie Ser Gin Ala Gin 
165 170 175 

Ser Trp Cys Leu Gin His Asp Leu Gly His Ala Ser lie Phe Lys Lys 
180 185 190 

Ser Trp Trp Asn His Val Ala Gin Lys Phe Val Met Gly Gin Leu Lys 

195 200 20^ 

Gly Phe Ser Ala His Trp Trp Asn Phe Arg His Phe Gin His His Ala 
210 215 220 

Lys Pro Asn He Phe His Lys Asp Pro Asp Val Thr Val Ala Pro Val 

225 230 235 240 

Phe Leu Leu Gly Glu Ser Ser Val Glu Tyr Gly Lys Lys Lys Arg Arg 
245 250 255 

Tyr Leu Pro Tyr Asn Gin Gin His Leu Tyr Phe Phe Leu lie Gly Pro 
260 265 270 

Pro Leu Leu Thr Leu Val Asn Phe Glu Val Glu Asn Leu Ala Tyr Met 
275 280 285 
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Fig. 8 cont. 



Leu Val Cys Met Gin Trp Ala Asp Leu Leu Trp Ala Ala Ser Phe 1 yr 
290 295 "* 300 

Ala Arg Phe Phe Leu Ser Tyr Leu Pro Phe Tyr Gly Val Pro Gly Val 

305 310 315 ' .-20 

Leu Leu Phe Phe Val Ala Val Arg Val Leu Glu Sei His Trp Phe Val 
325 " 330 335 

Trp lie Thr Gin Met Asn His lie Pro Lys Glu Gly His Glu lys 

340 345 350 

His Arg Asp Trp Val Ser Ser Gin Leu Ala Ala Thr Cys Asn Val c-'lu 
355 360 365 

Pro Ser Leu Phe Thr Asn Trp Phe Ser Gly His Leu Asn Phe Gin 3 le 
370 375 380 

Glu His His Leu Phe Pro Arg Met Pro Arg His Asr. Tyr Ser Arg \'al 
385 390 395 " 400 

Ala Pro Leu Val Lys Ser Leu Cys Ala Lys His Gly Leu Ser Tyr Glu 
405 410 415 

Val Lys Pro Phe Leu Thr Ala Leu Val Asp lie VaJ Arg Ser Leu Lys 
420 425 430 

Lys Ser Gly Asp lie Trp Leu Asp Ala Tyr Leu His Gin 
435 440 445 
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Fig. 9 

Oligonucleotide primers to amplify FADS1 cDNA 

TU12-R5 (5 ? -CGCCTGACAGCCCCTGCT-3 ' ) 
TU12-F10 (5'--CAGGTGGCCAATCACAAAAT-3 T ) 

TU12-R7 ( 5 • -CTCAAAGTGGAACCATCTGCTA-3 1 ) 
TU12-F9 f 5 1 -GGAAACCCAGTCCATGTTCC-3 f ) 

TU12-R6 ( 5 ' -CCTGGGCCTTTTCTTCATAGT- 3 ' ) 
TU12-F5 (5 1 -CTCAAGCTCCCCTCTGCCT-3' ) 



Oligonucleotide primers to amplify FADS2 cDNA 

TU13-R4 ( 5 1 -TCAGAAGCATAACCTGCGC-3 1 ) 

TU13-F7 { 5 1 -CCAGTTCACCAATCAGCAGG-3 ■ ) 

TU13-R3 ( 5 ' -CCCCTGCTGATTGGTGAACT-3 ' ) 

TU13-F4 ( 5 ' -TGTAGGGCAGGTATTTCAGC-3 ' ) 

TU13-R2 f 5 9 - AGC C CAT C GAG T AC G G C AA- 3 1 ) 

TU13-F1 {5 , -CCTCAGAACAAAAGCCCATC-3 f ) 



Oligonucleotide primers to amplify FADS3 cDNA 

TU19-R2 (S'-TCTTGCTCGGACCTCGGC-S 1 ) 
TU19-F2 ( 5 ' -GTGATCCACACGAACCAGTG-3 ' ) 

TU1 9-R3 ( 5 f -GAAGAACCCAGCCAGGATG-3 f ) 
TU19-F3 (S'-ACAGCTTTCCCCCAATTCTC-S 1 ) 
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The Search Division considers that the present European patent application does not comply with the 
requirements of unity of invention and relates to several inventions or groups of inventions, namely: 

1. Claims: 1-18 partially 

An isolated polynucleotide selected from the group 
consisting of 

polynucleotides having at least 65%, preferably 80% homology 
with a polynucleotide encoding a polypeptide of SEQ ID N0:4, 
comprising variants, under stringent conditions hybridizing 
molecules, complementary molecules, and oligonucleotides 
comprising at least 15 consecutive nucleotides of said 
sequence, preferably the polynucleotide of SEQ ID N0:1. 
Vectors, host cells, and transgenic organisms comprising 
said sequences. 

A polypeptide comprising a sequence having at least 65%, 
more preferably 85% homology to SEQ ID N0:4, variants 
thereof, and a peptide comprising at least 15 consecutive 
amino acids thereof. A process for producing said 
polypeptide using said host cells and DNA sequences. 
Antibodies against said polypeptides, and their use in 
diagnosis and therapy. 

An oligonucleotide primer having a sequence selected from 
the group of nucleotide sequences of SEQ ID N0s:7-12. 
A method of screening for modulators in known assays using 
constructs or of screening for interacting proteins or 
factors using state of the art technologies, as well as a 
method of screening chemical libraries comprising 
transformed cell lines, both methods employing the said 
sequences, vectors, or host cells. 

A compound which alters or reacts with at least one epitope 
of the proteins and which is obtained by said methods. 
Pharmaceutical compositions comprising as an effective 
component an effective amount of said peptides. 



2. Claims: 1-18 partially 

idem for SEQ ID N0s:2.5, 13-18 



3. Claims: 1-18 partially 

idem for SEQ ID N0s:3,6, 19-22 
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